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FOREWORD 


The  Tenth  Army  Conference  on  Applied  Mathematics  and  Computing  was  held  at  the  U.S. 
Military  Academy,  West  Point,  on  16-19  June  1992,  This  is  the  third  time  the  Military 
Academy  has  served  as  the  host  for  this  series  of  Army  conferences.  For  each  of  these 
meetings  the  heads  of  the  Department  of  Mathematics  served  as  Chairpersons  on  Local 
Arrangements.  Colonel  Frank  Giordano  served  twice  in  this  capacity.  For  the  tenth 
conference,  he  was  assisted  in  this  task  by  Lieutenant  Colonel  Scott  Crawford  and 
Captain  Rick  Stevens.  These  individuals  are  to  be  commended  for  their  efforts  in 
coordinating  ail  the  details  required  to  conduct  this  large  successful  scientific  meeting. 

The  1992  conference  was  attended  by  more  than  80  scientists  and  engineers 
representing  various  Army  agencies  and  academia.  The  meeting  featured  seven  invited 
speakers.  These  general  talks  covered  several  topics  of  current  interest,  including  natural 
language  processing,  wavelet  analysis,  variational  methods,  small  sample  asymptotics, 
computational  fluid  dynamics,  digital  control  programs,  and  parallel  programming.  The 
names  of  these  speakers,  together  with  the  titles  of  their  addresses  are  listed  below.  The 
second  part  of  the  program  consisted  of  special  sessions  on  topics  such  as 
computational  algebraic  geometry,  mathematical  aspects  of  material  sciences,  scalability 
in  high  performance  computing,  and  robust  control  and  nonlinear  systems.  In  addition, 
about  50  contributed  papers  were  presented  by  both  Army  and  Academic  participants. 


SPEAKER  AND  AFFILIATION 

Professor  Anil  Nerode  and 
Alexander  Yakhnis 
Cornell  University 

Professor  Ken  Kennedy 
Rice  University 


Professor  C.  R.  Rao 
Pennsylvania  State 


Professor  Aravind  K  Joshi 
University  of  Pennsylvania 

Professor  Charles  K.  Chui 
Texas  A&M  University 

Professor  David  Kinderlehrer 
Carnegie  Mellon  University 


TITLE  OF  ADDRESS 

Extraction  of  Digital  Control 
Programs  from  Specifications 
for  Continuous  Plants 

Architecture-Independent 
Parallel  Programming  Support 
in  Fortran  D 

Current  Trends  of  Research  in 
Statistics:  Robustness,  Small 
Sample  Asymptotics  &  Resampling 

Natural  Language  Professing 


Wavelet  Analysis  and  Its 
Applications 

Variational  Methods  for  the 
Materials  Sciences 
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SPEAKER  AND  AFFILIATION 


TITLE  OF  ADDRESS 


Professor  Paul  Woodward 
University  of  Minnesota 


Visualization  in  Computational 
Dynamics 


The  evening  events  provided  by  the  host  installation  added  a  great 
deal  to  this  conference:  On  Tuesday  evening  there  was  a  bus  tour 
of  West  Point,  on  the  following  evening  the  banquet  was  held,  and 
the  Thursday  evening  workshop,  put  on  by  the  members  of  the 
Mathematics  Division,  introduced  some  attendees  of  the  conference 
to  computer  algebra. 

This  conference  is  part  of  a  continuing  program  of  Army-wide 
symposia  held  under  the  auspices  of  the  Army  Mathematics  Steering 
Committee  (AMSC)  to  promote  better  communication  between  Army 
scientists  and  the  Army  Research  office  investigators.  In  ordex 
that  this  mission  be  accomplished,  a  large  number  of  scientists  had 
to  expend  a  great  deal  of  effort.  The  members  of  the  AMSC  would 
like  to  thank  all  these  individuals  for  their  excellent 
presentations  and  their  valuable  contributions  to  the  field  of 
science. 
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BY  CONVOLUTION  AND/OR  RANDOM  OBSERVATIONAL  ERROR 
Arthur  P.  Dempster,  Xianghui  Chen,  Jun  Liu,  and  Patricia  M. 
Meehan,  Harvard  University,  Cambridge,  Massachusetts 

SIMULATED  ANNEALING  ALGORITHMS  FOR  CONTINUOUS 
OPTIMIZATION 

Saul  B.  Gelfand,  Purdue  University,  West  Lafayette,  Indiana 

1530  - 1600  Break 

1600  -  1700  General  Session  IV  -  Thayer  Hail,  Room  342 

Chairperson:  David  W.  Hislop,  U.S.  Army  Research  Office, 
Research  Triangle  Park,  North  Carolina 

NATURAL  LANGUAGE  PROCESSING 

Aravind  K.  Joshi,  University  of  Pennsylvania,  Philadelphia, 
Pennsylvania 

xxxxxxxxxxxxxxxxxxxx 


0800  - 1600  Registration  -  Thayer  Hall,  Room  342 


0830  - 1030  Technical  Session  5  -  Applied  Finite  Element  Computations 

-  Thayer  Hall,  Room  342 


Chairperson:  C.J.  Quigley,  U.S.  Army  Materials  Technology  Laboratory, 
Watertown,  Massachusetts 


STRUCTURAL  ANALYSIS  OF  A  CYLINDRICAL  GUN  TUBE  SECTION 
SUBJECTED  TO  AN  INTERNAL  EXPLOSIVE  BLAST 

Aaron  Das  Gupta,  U.S.  Army  Ballistic  Research  Laboratory, 
Aberdeen  Proving  C-ound,  Maryland 


DEVELOPMENT  OF  A  COMPUTATIONAL  METHOD  FOR  CONVENTIONAL 
WEAPONS  ANALYSIS  OF  BURIED  STRUCTURES 

James  T.  Baylot,  U.S.  Army  Engineer  Waterways  Experiment  Station, 
Vicksburg,  Mississippi 

HIGH  SPEED  FLOW  SIMULATION  USING  A  HIGHER-ORDER  LOCAL 
PROJECTION  FINITE  ELEMENT  METHOD  ON  MASSIVELY  PARALLEL 
COMPUTERS 

Bernardo Cockburn,  University  of  Minnesota,  Minneapolis,  Minnesota, 
and  Nisheeth  Patel  and  Jerry  Clarke,  U.S.  Army  Ballistic  Research 
Laboratory,  Aberdeen  Proving  Ground,  Maryland 

ADAPTIVE  GRIDS  OF  HULL  HYDRODYNAMICS  CODE 

C.  Wayne  Mastin,  U.S.  Army  Engineer  Waterways  Experiment  Station, 
Vicksburg,  Mississippi 


xxxxxxxxxxxxxxxxxxxx 


0830  - 1030  Technical  Session  6  -  Nonlinear  Systems  and  Adaptive  Control 

.  -  Thayer  Hall,  Room  348 

Chairperson:  William  Jackson,  U.S.  Army  Tank-Automotive  Command, 
Warren,  Michigan 

SMOOTH  TIME-PERIODIC  FEEDBACK  SOLUTIONS  FOR  NONHOLONOMIC 
MOTION  PLANNING 

L.  Gurvits  and  Z.X.  Li,  Courant  Institute  of  Mathematical  Sciences, 
New  York  University,  New  York,  New  York 

CLASSIFICATION  OF  FINITE  DIMENSIONAL  ESTIMATION  ALGEBRAS: 
LOW  DIMENSIONAL  CASE 

Stephen  Yau,  Chi-Wah  Leung,  University  of  Illinois  at  Chicago, 
Chicago,  Illinois 

ADAPTIVE  GUNNER  MODELING  FOR  APPLICATION  TO  ARMORED 
VEHICLE  TECHNOLOGY 

John  Groff  and  Peter  Pazio,  U.S.  Army  Ballistic  Research  Laboratory, 
Aberdeen  Proving  Ground,  Maryland 


Thursday  (continued) 


MODELING  AND  CONTROL  ISSUES  OF  A  TWIN  LIFT  HELICOPTER  SYSTEM 
J.V.R.  Prasad,  Georgia  Institute  of  Technology,  Atlanta,  Georgia 

HYBRID  OPTIMAL  CONTROL  OF  TURRET-GUN  SYSTEM 

L.S.  Shieh  and  J.L  Zhang,  University  of  Houston,  Houston,  Texas, 
and  Norman  Coleman,  U.S.  Army  Armament,  Research  and 
Development  Center,  Picatinny  Arsenal,  New  Jersey 


1030  - 1100  Break 

1100  - 1200  General  Session  V  -  Thayer  Hall,  Room  342 

Chairperson:  Frank  A.  Giordano,  U.S.  Military  Academy,  West  Point, 
New  York 

WAVELET  ANALYSIS  AND  ITS  APPLICATIONS 

Charles  K.  Chui,  Texas  A&M  University,  College  Station,  Texas 


1200  -  1330  Lunch 

1330  -  1530  Special  Session  3  *  Scalability  in  High  Performance  Computing 

-  Thayer  Hall,  Room  342 

Chairperson:  Kenneth  D.  Clark,  U.S.  Army  Research  Office, 

Research  Triangle  Park,  North  Carolina 

SCALABILITY  ISSUES  IN  INTERCONNECTION  NETWORKS 

Frank  T.  Leighton,  Massachusetts  Institute  of  Technology,  Cambridge, 
Massachusetts 

SCALABLE  PARALLELISM  FOR  PDE  COMPUTATIONS 

John  R.  Rice,  Purdue  University,  West  Lafayette,  Indiana 

ANALYZING  SCALABILITY  OF  PARALLEL  ALGORITHMS  AND 
ARCHITECTURES 

Vipin  Kumar,  University  of  Minnesota,  Minneapolis,  Minnesota 


xxxxxxxxxxxxxxxxxxxx 
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Thursday  (continued) 


1330  - 1530  Technical  Session  7  -  Dynamical  Systems  and  Differential 

Equations  II  -  Thayer  Hall,  Room  348 

Chairperson:  J.S.  Robertson,  U.S.  Military  Academy,  West  Point, 

New  York 

FRONT-PROPAGATION  IN  REACTION-DIFFUSION  EQUATIONS 

Halil  Mete  Soner,  Carnegie  Mellon  University,  Pittsburgh,  Pennsylvania 

NUMERICAL  TREATMENT  OF  RANDOM  DIFFERENTIAL  EQUATIONS 

G.S.  Ladde,  University  of  Texas  at  Arlington,  Arlington,  Texas,  and 
S.  Sathananthan,  Jarvis  Christian  College,  Hawkins,  Texas 

DIAGONALIZATION  AND  STABILITY  OF  TWO  TIME-SCALE  SINGULARLY 
PERTURBED  LINEAR  INTEGRO-DIFFERENTIAL  EQUATIONS 

G.S.  Ladde,  University  of  Texas  at  Arlington,  Arlington,  Texas,  and 
S.  Sathananthan,  Jarvis  Christian  College,  Hawkins,  Texas 

1)  DYNAMICAL  SYSTEMS  IN  ASYMMETRIC  SPACE  AND  TIME 

2)  SLOW  AND  ULTRAFAST  WAVE  PROPAGATION 

Richard  A.  Weiss,  U.S.  Army  Engineer  Waterways  Experiment  Station, 
Vicksburg,  Mississippi 

CONSIDERATION  OF  PHASE  TRANSFORMATIONS  OF  SHEAR  BANDS  IN  A 
DYNAMICALLY  LOADED  STEEL  BLOCK 

Z.G.  Zhu  and  R.C.  Batra,  University  of  Missouri-Rolla,  Rolla,  Missouri 


1530-1600  Break 

1600  •  1700  General  Session  VI  •  Thayer  Hall,  Room  342 

Chairperson:  Julian  Wu,  U.S.  Army  Research  Office, 
Research  Triangle  Park,  North  Carolina 

VARIATIONAL  METHODS  FOR  THE  MATERIALS  SCIENCES 

David  Kinderlehrer,  Carnegie  Mellon  University,  Pittsburgh, 
Pennsylvania 


xxxxxxxxxxxxxxxxxxxx 
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0800  -  1200  Registration  -  Thayer  Hall,  Room  342 

0830  - 1030  Special  Session  4  -  Robust  Control  and  Nonlinear  Systems 

-  Thayer  Hall,  Room  342 

Chairperson:  Norman  Coleman,  U.S.  Army  Armament,  Research, 

Development  and  Engineering  Center,  Picatinny  Arsenal, 
New  Jersey 

NON-COLLOCATED  MOTION  CONTROL  OF  A  FLEXIBLE  BEAM  BASED  ON 
A  DELAYED  ADAPTIVE  INVERSE  METHOD 

David  S.  Wang,  Guo-Ben  Yang,  and  Max  Donath,  University  of 
Minnesota,  Minneapolis,  Minnesota,  and  Mike  Mattice  and  Norman 
Coleman,  U.S.  Army  Armament,  Research,  Development  and 
Engineering  Center,  Picatinny  Arsenal,  New  Jersey 

CONTROL  DESIGN  FOR  FLEXIBLE  STRUCTURES 

Karashad  Khorrami,  Polytechnic  University,  Brooklyn,  New  York 

A  NONLINEAR  FEEDBACK  CONTROL  STRATEGY  BASED  ON  TRAJECTORY 
PATTERNS 

Dale  Enns,  Honeywell  Systems  Research  Center,  Minneapolis, 
Minnesota,  and  Allen  Tannenbaum,  University  of  Minnesota, 
Minneapolis,  Minnesota 

ADVANCED  NONLINEAR  CONTROL  THEORY  FOR  GUN  CONTROL/ 
STABILIZATION 

Jay  Rastegar  and  Folkert  Tangerman,  SUNY  at  Stony  Brook, 

Stony  Brook,  New  York 

xxxxxxxxxxxxxxxxxxxx 


0830  -  1030  Technical  Session  8  *  Computer  Modeling  and  Scientific  Visualization 

•  Thayer  Hall,  Room  348 

Chairperson:  Herbert  Cohen,  U.S.  Army  Material  Systems  Analysis 
Activity,  Aberdeen  Proving  Ground,  Maryland 

VISUALIZATION  OF  DYNAMIC  SOIL-STRUCTURE  INTERACTION  ANALYSIS 
Robert  L  Hall,  Julia  A.  Baca,  and  Donald  Nelson,  U.S.  Army  Engineer 
Waterways  Experiment  Station,  Vicksburg,  Mississippi 
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Friday -(continued) 


ADVANCED  COMPUTER  MODELING  OF  METEOROLOGICAL  EFFECTS 
UPON  ARTILLERY  PROJECTILE  FUGHT 

Abol  J.  Blanco,  U.S.  Army  Atmospheric  Sciences  Laboratory, 

White  Sands  Missile  Range,  New  Mexico 

SCIENTIFIC  VISUALIZATION  OF  FLOW  FIELDS 

Charles  S.  Jones  and  Julia  A.  Baca,  U.S.  Army  Engineer  Waterways 
Experiment  Station,  Vicksburg,  Mississippi 

THE  GENERALIZED  BALANCED  TENNARY  (GBT)  PROPOSED  FOR 
APPLICATION  SCIENTIFIC  VISUALIZATION 

Robert  E.  DeKinder,  U.S.  Army  Atmospheric  Sciences  Laboratory, 
White  Sands  Missile  Range,  New  Mexico,  and  John  R.  Bams, 
Computer  Sciences  Corporation,  White  Sands  Missile  Range  Support 
Office,  Las  Cruces,  New  Mexico 

FLOW  COMPUTATION  ABOUT  ARMY  PROJECTILES  AND  MISSILES  USING 
MULTI-ZONE  PARALLEL  COMPUTER  ARCHITECTURES 

Nisheeth  Patel,  Jerry  Clarke,  and  Monte  Coleman,  U.S.  Army  Ballistic 
Research  Laboratory,  Aberdeen  Proving  Ground,  Maryland 

APPLICATION  OF  FINITE  ELEMENT,  GRID  OPERATION,  AND  SCIENTIFIC 
VISUALIZATION  TECHNIQUES 

Fred  Tracy,  U.S.  Army  Engineer  Waterways  Experiment  Station, 
Vicksburg,  Mississippi 


1030-1100  Break 

1100  - 1200  General  Session  VII  •  Thayer  Hall,  Room  342 

Chairperson:  Jagdish  Chandra,  U.S.  Army  Research  Office, 
Research  Triangle  Park,  North  Carolina 

VISUALIZATION  IN  COMPUTATIONAL  FLUID  DYNAMICS 

Paul  Woodward,  University  of  Minnesota,  Minneapolis,  Minnesota 

1200-1215  ADJOURNMENT 
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TOWARD  A  NEW  METHOD  OF  DECODING  ALGEBRAIC  CODES 

USING  GROBNER  BASES 


A.  Brinton  Cooper,  III 
U.S.  Army  Research  Laboratory 
Aberdeen  Proving  Ground,  Maryland  21005-5066 
USA 

Abstract 

A  binary  BCH  error  control  code  is  a  vector  subspace  of  binary  n-tuples.  Alge¬ 
braically,  the  code  is  generated  by  a  polynomial  having  binary  coefficients  and  roots  in 
GF(2m).  It  is  decoded  by  computing  a  set  of  syndrome  equations  which  are  multivariate 
polynomials  over  GF(2m)  and  which  exhibit  a  certain  symmetry.  If  the  number  of  trans¬ 
mission  errors  in  a  received  word  does  not  exceed  a  bound  t  for  the  code,  the  roots  of  the 
syndromes  are  tne  locations,  in  the  received  word,  of  those  errors.  These  multivariate 
polynomials  are  taken  as  the  basis  for  an  ideal  in  the  ring  of  polynomials  in  t  variables 
over  GF(2m).  A  celebrated  algorithm  by  Buchberger  produces  a  reduced  Grobner  basis 
of  that  ideal.  It  turns  out  that,  since  the  common  roots  of  all  the  polynomials  in  the 
ideal  are  a  set  of  isolated  points,  this  reduced  Grobner  basis  is  in  triangular  form,  and 
the  univariate  polynomial  in  that  basis  is  the  well-known  BCH  error  locator  polynomial, 
the  roots  of  which  specify  the  error  locations.  Decoding  is  algorithmically  complete  when 
this  polynomial  is  known. 


1  Introduction 


Modern  algebraic  techniques  have  been  used  to  design  and  decode  codes  for  error  control  as 
far  back  as  the  presentation  of  Hamming’s  codes  [1].  In  the  early  1960s,  the  binary  BCH1 
codes  [3-6]  were  discovered  independently  by  Bose  and  Chaudhuri  and  by  Hocquenghem. 
The  BCH  codes  and  their  descendents  are  popular  for  several  reasons,  including  their  regular 
algebraic  structure  which  permits  easy  encoding  using  simple  shift  registers  and  the  existence 
of  codes  for  a  wide  range  of  block  lengths  and  error  correction  capabilities. 

However,  the  asymptotic  performance  of  BCH  codes  is  not  “good”  [6]  in  that  the  er¬ 
ror  probability  after  decoding  and  the  information  rate  of  the  code  are  not  simultaneously 
bounded  away  from  zero  with  increasing  block  length.  Nevertheless,  the  BCH  codes  and 

1  Bose-Chaudhuri-Hocquenghem.  McEliece  [2]  presents  an  interesting  history  of  the  naming  of  these  codes. 
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their  derivatives  are  widely  used  because  they  are  easy  to  generate,  well  understood,  and 
useful  in  the  control  of  transmission  errors  over  noisy  channels.  Decoders,  however,  are 
complex,  and  work  continues  to  find  simpler  and  more  powerful  decoders. 

This  work  applies  recent  results  from  the  algebra  of  multivariate  polynomials  to  the 
direct  solution  of  the  syndrome  equations  of  binary  BCH  codes.  In  this  problem,  up  to  t 
nonlinear  polynomial  equations  must  be  solved  for  the  locations  of  the  errors. 

Following  a  review  of  the  basic  theory  of  linear  block  codes,  Section  II  presents  the 
polynomial  model  of  cyclic  codes  and  shows  how  a  BCH  code  is  specified  solely  by  a  set  of 
roots  of  its  generator  polynomial.  Section  in  defines  the  essential  problem  for  decoding  BCH 
codes.  Section  IV  casts  the  problem  into  ideals  in  the  polynomial  ring  GF(2m)(Ai, . .  .,Xt], 
such  ideals  being  defined  by  their  roots.  Modern  methods  are  used  to  solve  these  equations 
directly. 

Examples  are  included. 


2  Linear  Block  Codes 

2.1  Introduction 

A  common  method  for  controlling  errors  in  information  transmitted  over  noisy  channels  is 
the  use  of  linear  block  codes  (LBC)2.  Algebraically,  a  LBC  is  a  subspace  of  a  vector  space  of 
n-tuples  over  a  finite  field  and,  therefore,  has  a  basis  which  spans  the  code.  The  dimension 
of  the  LBC  is  smaller  than  n,  the  number  of  elements  in  the  n— tuple.  This  gives  rise  to  the 
existence  of  n  -  k  redundant  symbols  in  each  codeword.  This  redundancy  provides  distance 
between  pairs  of  codewords.  The  sense  in  which  we  define  “nearness”  is  Hamming  distance. 

Definition:  The  Hamming  distance  dn  between  two  n-tuples  is  the  number  of  places 
in  which  they  differ. 

Channel  noise  often  reduces  the  distance  between  two  received  words.  Sufficient  redun¬ 
dancy,  however,  can  provide  enough  inter-codeword  distance  to  protect  against  specified 
levels  of  channel  noise. 


2 For  a  thorough  coverage  of  this  topic,  the  reader  is  referred  to  any  of  several  excellent  texts  [2-9]. 
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2.2  Poiynomials  and  Cyclic  Codes 

Using  powers  of  an  indeterminate  x  as  placeholders  permits  writing  a  polynomial  model 
of  the  LBC.  This  is  more  than  formalism,  however,  as  it  permits  code  construction  and 
decoding  based  upon  the  roots  of  certain  polynomials. 

Information  is  carried  in  the  (binary)  coefficients  of 

*(1)  =  to  +  »ix  + - +■  ijfc-1**-1,  ij  €  GF(2),  j  =  0, 1, . . k  —  1.  (1) 

Codeword  polynomials  are  generated  by  multiplying  t(x)  by  a  generator  polynomial  g(x)  of 
degree  n  -  k: 

g(x)  =  9o  +  9\x  +  •  •  •  +  Sn-k*n-\  9j  €  GF(2),  j  =  0, 1, . . . ,  n  -  k  (2) 

Coefficients  of  the  resulting  polynomial  v(x)  represent  the  binary  symbols  in  the  codeword: 
u(x)  =  i(x)g(x) 

=  v0  +  vix  +  ••  •  +  vn_ixn-1,  Vj  G  GF(2),  =  0, l,...,n-  I  (3) 

A  code  is  said  to  be  cyclic  if  every  cyclic  shift  of  every  codeword  is  also  a  codeword.  Alge¬ 
braically,  it  is  true  that  a  code  is  cyclic  whenever  g(x) \xn  -  1,  and  it  follows  that  a  cyclic 
code  is  an  ideal3  in  the  ring  of  polynomials  modulo  xn  —  1.  More  important,  the  codeword 
length  n  is  the  smallest  integer  for  which  p(x)|xn  -  1. 


2.3  BCH  Codes 

The  BCH  codes  provide  a  convenient  paradigm  for  several  families  of  powerful  LBCs  includ¬ 
ing  Reed-Solomon  [2-9]  and  Goppa  [2]  codes.  A  binary,  primitive  BCH  code  is  a  cyclic  code 
of  length  n  =  2m  —  1.  Its  generator  polynomial  numbers  among  its  roots  2 1  consecutive 
powers4  of  a  primitive  element  a  of  the  locator  field  GF(2m).  With  correct  decoding,  this 
code  can  correct  up  to  t  channel  errors  in  every  codeword.5 

Example:  Let  m  =  4  and  t  =  2.  Then  n  =  15  and  the  roots  of  g(x)  include 
a,  a2,  a3,  and  a4.  Because  a15  =  1,  these  must  also  be  roots  of  p(x):  {a8,  a6,  a12,  a9}. 

3  A  formal  definition  of  ideal  is  given  later. 

4The  nonzero  powers  a°,al , . . .  of  a  primitive  element  of  GF(2m)  are  the  distinct  nonzero  elements 

of  that  field. 

5In  order  that  the  codewords  be  binary,  it  is  necessary,  for  every  root  /3*  of  g(x),  that  all  conjugates 
{/?2' ,  0*', . . .}  be  roots  of  g(x)  as  well. 
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Hence,  the  degree  of  g(x)  is  n-  k  =  8  so  that  the  dimension  k  of  the  code  is  7.  (t.e.,  the  code 
has  27  =  128  code  words.)  The  code  is  capable  of  correcting  at  least  t  =  2  errors  in  every 
codeword,  and  the  code  rate,  k/n  is  0.47  information  bits  per  binary  symbol  transmitted. 


3  BCH  Decoding 


Of  course  correcting  t  errors  in  a  codeword  of  length  n  requires  a  decoding  procedure  that  can 
achieve  this  error  correcting  potential.  A  trivial  but  completely  correct  decoding  technique 
is  to  construct  a  table  of  every  received  binary  n-tuple  and  the  codeword  into  which  it  must 
be  decoded.  When  symbol  errors  are  independent,  the  rule  is  to  decode  a  received  n-tuple 
into  the  nearest  codeword6 

However,  such  table  lookup  decoding  is  feasible  only  for  rather  small  codes,  so  we  continue 
to  be  interested  in  algorithmic,  algebraic  decoders  which  are  much  faster  and  demand  much 
less  storage.  Let  r(x)  represent  the  received  vector  when  a  t— error  correcting  BCH  codeword 
v(x)  is  transmitted  over  a  channel  corrupted  by  additive  noise: 

r(x)  =  v(x)  +  e(x).  (4) 

e(x)  is  the  error  polynomial:  ej  =  1  if  an  error  occurred  in  the  jth  position  and  0  otherwise. 
The  paradigm  for  many  useful  decoders  of  this  code  is  a  four-step  decoding  procedure: 

•  calculate  syndromes,  functions  of  the  coeffcients  of  r(x); 

•  calculate  coefficients  of  the  error  locator  polynomial; 

•  solve  the  error  locator  polynomial  for  the  locations  of  the  errors;  and 

•  (for  nonbinary  codes)  calculate  the  error  values. 

3.1  The  Syndromes 

Consider  the  channel  output,  r(x)  as  given  by  (4).  The  jth  syndrome  value  is: 

Sj  =  r(aJ)  =  g(a ■’)  +  e(aJ)  =  e(aJ),  j  =  1,...,2 1  (5) 

“Because  this  is  a  minimum  distance  decoding  techique,  no  other  decoder  can  correct  more  errors  on  a 
memory  less  channel. 
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Writing  only  those  coefficients  €j  which  are  not  zero  leads  to  the  following  form  of  the  2 1 
syndrome  equations: 


e^a*1  4-  e,aQ,a  4-  •  •  •  4-  eitau  =  Si 
e,-j  a2’1 +e,-3o2‘s  4 - 4-  e,ta2“  =  Si 

eila2t"+eiia2ti>  +  ---  +  eila2ti'  =  St 


(6) 


Note  the  following: 

(a)  In  (6),  if  a'i  for  any  j  is  known,  then  the  location  ij  of  the  corresponding  error  also 
is  known.  It  is  convenient,  therefore,  to  write  Xj  =  o'j  .  The  values  of  the  are  called  the 
error  locators  of  the  received  word. 

(b)  In  any  field  GF(2m)  of  characteristic  two,  (a4-6)2  =  a2  4-i>2-  Therefore,  in  (6)  every 
syndrome  computed  from  even  powers  of  a  is  an  even  power  of  some  syndrome  computed 
from  odd  powers  of  a;  e.g.,  Si  =  S2.  These  are  redundant  and  do  not  contribute  to  solving 
for  the  error  locators. 

(c)  In  (6),  e,;  =  1  ,j  =  1,...,2 1.  The  syndromes  {S-,  j  =  1, ...,  2t}  are  known 
(computed)  elements  of  GF(2m)  and  can  be  expressed  as  powers  of  a;  i.e.,  S„  =  aJ<T . 

Considering  (a),  (b),  and  (c)  with  (6)  gives  a  system  of  t  polynomial  equations,  the 
solutions  to  which  are  the  error  locators  of  the  received  word. 

Si  =  ocn  =  Xi  4-  Xi  4- - b  Xt 

S3  =  a»  =Xi3  +  Xi3  +  ---  +  Xt 3 

Sit-i  =  4-  X22i-J  4-  •  •  •  +  Xt21"1  (7) 


3.2  The  Error  Locator  Polynomial 

Derivation  of  (7),  a  set  of  power-sum  symmetric  functions,  assumed  that  no  more  than  t 
errors  occured  in  a  block  of  length  n.  The  error  locator  polynomial  is  derived  from  these 
functions. 


5 


Definition:  The  error  locator  polynomial  <r( i)  is  the  (univariate)  polynomial  all  the 
roots  of  which  indicate  the  locations  of  errors  in  a  received  word. 

=  n<-*> 

t=i 

=  x*  +  ffiar*-1  +  +  •  •  •  +  <?t  (8) 

Decoding  is  complete  when  the  roots  of  cr(x )  are  found  and  the  necessary  corrections  made 
to  r(x).  The  Chien  search  [8]  is  a  method  for  doing  this  without  explicitly  solving  o(x).  This 
method  uses  a  digital  circuit  which  evaluates  a{x)  at  each  member  aJ  of  GF(2m)  and  sets  a 
correction  bit  to  unity  if  cr(x)  is  satisfied.  The  received  polynomial  r(x)  is  docked  through 
the  circuit  and  the  correction  bit  is  added  modulo  2  at  the  appropriate  location.  Whenever 
a  root  of  o(x )  is  found,  therefore,  the  appropriate  received  symbol  is  complemented.  The 
Chien  search  will  be  required  in  implementing  the  direct  solution  methods  discussed  below. 


4  Direct  Solution  Techniques 

The  objective  is  to  find  a  solution  set  to  (7): 

ceJi  =  X\  +  X2  +  •  •  •  +  Xt 
aj 3  =  X,3  +  X23  +  •  •  •  +  X,3 

<**'-»  =  Xi2'-1  +  X22t-' +  ■  ■  ■  +  Xt  2,-1  (9) 

where  a  is  a  primitive  element  in  GF(2m).  Because  the  number  of  errors  in  a  received  word 
does  not  exceed  t,  (9)  is  a  system  F  of  t  independent  equations  with  at  most  t  solutions.  Hence 
F  is  a  system  of  t  polynomials  in  t  unknowns  and  has  one  unique  solution,  0  =  (Pi, . .  ,0t)7. 


4.1  Rings  and  Ideals 

Direct  solution  techniques  of  (9)  exploit  the  rich  algebraic  structure  of  the  ring  R  =  A' [X]  = 
K[Xi,X2,  . .  ■ ,  Xt]  of  polynomials  in  t  variables  over  K  =  GF(2m)  [11].  A  subset  T  of  a  ring 
is  called  an  ideal  if  it  is  a  subgroup  of  the  additive  group  of  the  ring  and  if,  for  every  i  G  X 

7Actuaily,  the  rigorousl}  correct  s.atement  is  that  all  zeros  of  the  system  are  “equivalent’’  and  “mapped 
on  one  another  by  an  isomc  rphism  which  leaves  fixed  the  elements  of  the  ground  field...”  (10] 
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and  -‘very  r  e  R,  both  ir  and  ri  belong  to  X.  Hilbert’s  Basis  Theorem  [10]  requires  that 
every  ideal  in  K[X]  have  a  finite  basis. 

Consider  F  to  be  a  subset  of  the  ring  AT[X].  The  set  1(F)  spanned  by  members  of  F 
(where  coefficients  are  taken  from  K[X])  is  an  ideal  in  Jf[X]: 

1(F)  £  (F)  C  K(X}.  (10) 


The  common  zeros  of  the  polynomials  of  F  are  said  to  form  an  algebraic  manifold,  [10] 
which  is  “defined  by”  those  polynomials.  All  points  of  the  manifold  satisfy  every  member 
of  1(F).  Direct  solution  techniques  require  searching  1(F)  for  another  set  G  of  polynomials 
which  are  simpler  to  solve  than  those  in  F.  Hence,  new  methods  for  finding  bases  of  ideals 
in  AT[X]  bear  on  the  decoding  problem. 


4.2  A  Basis  for  the  Ideal 

The  objective  is  to  find  for  1(F)  a  basis  G  which  is  “easily”  solved  for  the  underlying  roots. 

The  basis  G  is  obtained  from  the  defining  polynomial  set  F  by  applying  transformations 
which  do  not  eliminate  any  roots  of  the  system. 

Example:  Suppose  set  F  is: 

A  :  Xx  +  X2  +  a?  =  0 

h  :  +  Xl  +  ak  =  0,  (11) 

and  suppose  that  it  is  known  that  this  system  has  the  solution  (0x,fl2)  €  GF(2m)2.  Then 

y(X)  =  al(X)A(X)  +  a2(X)/2(X)  (12) 

is  satisfied  by  (/3X,P2)  as  well8. 

Suppose  a2(X)  =  1  and 

oi(X)  =  Xj *  +  Xj(X2  +  QJ)  +  (X2  +  «J)2.  (13) 

Then, 

y(X)  =  X\a?  +  X2a2>  +  a2j  +  a* ,  (14) 

®Of  course,  if  aif"".)  and  <J2(X)  have  a  common  factor,  y(X)  may  have  an  additional  root  that  does  not 
satisfy  / 1  or  /2,  but  this  case  is  of  no  interest. 
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and  the  system  has  been  reduced  from  two  equations  (a  cubic  and  a  Linear)  to  a  single, 
univariate,  second  degree  equation  having  the  same  solution  We  say  that  the  cubic 

has  been  reduced  modulo  F  to  t/(X). 


The  algorithm  for  deriving  the  desired  ideal  basis  G  is  based  upon  such  reduction  op¬ 
erations.  It  produces  a  reduced  Grobner  basis  [12]  of  the  ideal  spanned  by  F.  A  reduced 
Grobner  G  basis  is  a  set  of  polynomials: 

•  which  is  a  basis  of  the  ideal; 

•  each  member  of  which  has  coefficient  of  highest  order  term  =  1; 

•  no  element  of  which  can  be  reduced  modulo  G. 


It  is  known  [12]  that  a  reduced  Grobner  basis  for  1(F):  can  be  written  in  triangularized 
form: 


Si 

92 


92(XuX2) 


(15) 


9t  —  9t(X\,X2,- •  -,Xt) 


This  suggests  a  recursive  root  finding  technique.  However  the  univariate  member  g\  of  the 
set  is,  in  fact,  the  BCH  error  locator  polynomial  [13].9 


4.3  Grobner  Bases  as  a  Basis  for  Decoding 

It  would  be  redundant  to  include  here  the  general  form  of  Buchberger’s  algorithm  for  finding 
the  Grobner  basis  of  an  ideal  1(F).  The  interested  reader  should  refer  to  the  literature,  of 
which  [12]  is  the  most  comprehensive  source.  The  method  is  illustrated  in  this  example: 

Example:  This  is  the  general  form  of  the  problem.  Take  K  to  be  GF(24),  and  t  =  3. 
Then  the  resulting  g(x)  generates  a  3-error  correcting  code  with  block  length  n  =  24  -  1, 

9At  worst,  g\  is  isomorphic  to  the  error  locator  polynomial.  As  shown  in  [13]  however,  the  isomorphism 
is  trivial. 
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dimension  it  =  5,  and  32  code  words.  In  general,  the  decoder  produces  these  non-redundant 
syndromes: 

X\  +  X2  +  X3  +  o'  =  0 

Xl  +  Xl  +  X§  +  a>  =  0  (16) 

X*  +  Xl  +  X35  +  a*  =  0. 

Define  three  intermediate  polynomials, 

MX)  =  £xi(xa  +  xi +  **)*-* 

j=0 

MX)  =  £xj(X2  +  X1  +  a<)4-J 

j— 0 

MX)  =X*  +  X2X1  +  X2ai  +  Xl  +  X1ai  +  a7i,  (17) 

and  from  these  produce  three  “coefficient”  polynomials: 

<*i(X)  =  PiP3(-Xi  +  o')  +  MX\  +  a')  +  pi(a^  +  a3') 

a2(X)  =  P3(Xj  +  a1)  +  a2  +  a3' 

a3(X)  =  Xi  +a\  (18) 

Substitute  the  pi  into  the  a;  to  get 

m  (X)  =  XiX*  +  a’X2  +  XiX2Xl  +  a'X2Xl  +  X?Xi  +  a2'X3  +  X2X2X2 

+a2'X2X2  +  a'X2X2  +  a2iXt  X|  +  aJX2  +  a3‘Xf  +  X2X\X3  +  a2,X2X3 
+X3X2X3  +  ajX2X3  +  a’XfX3  +  ajXtX3  +  aJ+,X3  +  a4iX3  +  X2X3 
+a2‘X$  +  a'XlXl  +  a2iX\X2  +  aJX%  +  a3,X|  +  X{X2  +  a4iX2 
+a'X4  +  a2iXf  +  a>X2  +  a4iX2  +  ai+2i  +  a5i 
a2(X)  =  XxXl  +  a'X2  +  X\X2  +  a2iX2  +  X,3  +  a3 . 

This  yields  a  univariate  polynomial  which  we  recognize  as  the  error  locator  polynomial: 
a(X3)  =  J2ar(X)MX) 

1/= 1 

=  Xf(a»  +  q3‘)  +  Xi(ai+j  +  a4')  +  X3(o*  +  a2i+>)  (19) 

+a,+*  +  a2j  +  a3,+j  +  a6'. 

Finding  o{X)  solves  the  decoding  problem. 
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5  Conclusion 


Mathematically,  we  have  shown  a  decoder  that  computes  a  set  of  syndrome  values  which  are 
functions  of  the  roots  of  the  code’s  generator  polynomial  and  of  the  error  locations.  These 
syndromes  are  the  constant  terms  of  a  system  of  nonlinear  polynomials.  We  have  presented 
a  method  for  extracting  from  that  system  the  error  locator  polynomial,  which  is  satisfied  by 
the  error  locations  expressed  as  elements  of  GF(2m).  The  coefficients  of  the  error  locator 
polynomial  are  functions  of  the  syndrome  values  only.  Thus,  the  decoder  need  do  only  two 
things:  compute  syndromes  and  coefficients. 

Work  is  ongoing  to  generalize  this  method  and  to  extend  it  to  Reed-Solomon  and  Goppa 
codes. 


References 


[1]  Hamming,  R.W.,  “Error  detecting  and  error  correcting  codes,”  B.S.T.J  29  (April,  1950), 
147-160. 

[2]  R.J.  McEliece,  “The  Theory  of  Information  and  Coding,”  in  Encyclopedia  of  Mathematics 
and  its  Appliations,  Volume  3,  Cambridge  University  Press,  Cambridge,  1977. 

[3]  W.W.  Peterson  and  E  J  Weldon,  Jr,  Error-Correcting  Codes,  MIT  Press,  Cambridge, 
1972. 

[4]  Lin,  S.  &  D.J.  Costello  Jr.,  Error  Control  Coding,  Prentice-Hall,  Englewood  Cliffs,  1983. 

[5]  Blahut,  R.E.,  Theory  and  Practice  of  Error  Control  Codes,  Addison- Wesley,  Reading, 
1983. 

[6]  MacWilliams,  F.J.  &  N.J.A.  Sloane,  The  Theory  of  Error  Correcting  Codes,  North- 
Holland,  Amsterdam,  1977. 

[7]  Berlekamp,  E.R.,  Algebraic  Coding  Theory,  McGraw-Hill,  New  York,  1968. 


10 


[8]  Michelson,  A.M.  &  A.H.  Levesque,  Error-Control  Techniques  for  Digits  Communication, 
Wiley,  New  York,  1985. 

[9]  Pless,  V.,  Introduction  to  the  Theory  of  Error-Correcting  Codes,  Wiley-Interscience,  New 
York, 1982. 

[10]  van  der  Waerden,  B.L.,  Modern  Algebra  -  Volume  II,  Frederick  Ungar,  New  York,  1950. 

[11]  van  der  Waerden,  B.L.,  Modern  Algebra  -  Volume  I,  Frederick  Ungar,  New  York,  1953. 

[12]  Buchberger,  B.,  “An  Algorithmic  Method  in  Polynomial  Ideal  Theory,”  in  Multidimen¬ 
sional  Systems  Theory,  N.K.  Bose,  ed..  Mathematics  and  Its  Applications,  D.  Reidel, 
Boston,  1985. 

[13]  Cooper,  A.2.,  III,  “Finding  BCH  error  locator  polynomials  in  one  step,”  Electronics 
Letters  27  (24  October  1991),  2090-2091. 


11 


An  Algorithm  for  the  Computation  of 
Grobner  Bases 

W.  W.  Adams  *  A.  Boyle  t  P.  Loustaimau  * 


Abstract 

Let  R  be  a  Noetherian  integral  domain  which  is  graded  by  an 
ordered  group  T  and  let  x  be  a  set  of  n  variables  with  a  term  order. 
In  this  paper  we  present  a  new  algorithm  for  computing  Grobner  bases 
in  the  ring  il[x].  This  algorithm  is  based  on  the  authors  earlier  paper 
[2].  In  the  case  where  R  =  &(yj  is  graded  by  a  term  order,  then  this 
gives  a  new  algorithm  for  computing  Grobner  bases  in  fc[y,x].  This 
algorithm  requires  the  computation  of  many  Grobner  bases  but  in 
fewer  variables  than  the  usual  Buchberger  Algorithm. 


1  Introduction 

Let  y  and  x  be  sets  of  variables,  each  with  a  term  order  and  an  elimination 
order  between  them.  Let  k  denote  a  Noetherian  commutative  ring.  In  several 
places  in  the  literature  (e.g.  [1],  [2],  [3],  [5],  [9],  [10]),  the  problem  of  lifting 
Grobner  bases  from  the  ring  fc[y]  to  the  ring  A;[y,x]  has  been  investigated. 
This  entails  understanding  the  difference  between  a  Grobner  basis  in  (fc[y])[x| 
and  a  Grobner  basis  in  fc[y,x].  We  refer  to  this  as  the  transitivity  question. 
This  problem  was  examined  in  [1]  mainly  for  the  case  when  x  =  x  consisted 
of  a  single  variable.  There  it  was  shown  that  certain  subsets  of  the  leading 
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coefficients  with  respect  to  x  must  form  Grobner  bases  in  fc[y]  in  order  to  go 
from  a  Grobner  basis  in  (fc[y])[:r]  to  a  Grobner  basis  in  fc[y,x).  The  results  in 
[1]  were  generalized  in  [2]  to  the  case  where  x  is  more  than  one  variable.  The 
key  concept  was  the  so-called  saturated  sets  of  polynomials  as  introduced 
by  Moller  in  [7]  (where  they  were  called  maximal  sets).  In  this  paper  we 
provide  a  preliminary  discussion  of  how  these  ideas  can  be  used  to  give  a 
new  algorithm  for  computing  Grobner  bases. 

In  [2]  we  found  that  graded  rings  are  a  natural  setting  for  the  transitivity 
question.  So,  let  R  be  a  Noetherian  integral  domain  and  assume  that  R 
is  graded  by  an  ordered  group  T.  The  concept  of  Grobner  basis  can  be 
extended  to  such  graded  rings  R  (see,  for  example  [2],  [6]).  If  x  denotes  a  set 
of  n  variables,  the  ring  i?[x]  can  be  graded  both  by  T  x  Zn  and  {0}  x  Zn. 
Then  the  transitivity  question  in  this  setting  becomes:  when  is  a  Grobner 
basis  in  i?[x],  graded  by  {0}  x  Zn,  also  a  Grobner  basis  in  i?[x],  graded  by 
T  x  Z”.  The  solution  is  stated  in  terms  of  certain  sets  of  leading  coefficients, 
corresponding  to  the  so-called  saturated  subsets,  being  Grobner  bases  in  R. 

In  Section  2  we  will  give  the  definitions  for  the  generalization  of  the  con¬ 
cept  of  Grobner  bases  to  graded  rings  and  state  the  usual  characterizations 
of  Grobner  bases  in  this  context.  In  Section  3  we  will  give  some  computabil¬ 
ity  conditions  on  a  graded  ring  so  that  we  can  give  the  usual  Buchberger 
algorithm  for  computing  Grobner  bases  in  such  rings.  In  Section  4  we  recall 
from  [2]  the  results  on  transitivity  that  allow  us  to  give  our  new  algorithm. 
This  algorithm  is  presented  in  Section  5  and  an  example  is  given  computing 
a  Grobner  basis  by  this  method. 


2  Graded  Rings 

In  this  section  we  briefly  review  the  definitions  for  the  theory  of  Grobner 
bases  in  a  general  graded  ring.  For  more  details  see  [2]. 

Let  T  be  an  additive  abelian  group  which  is  totally  ordered  with  respect 
to  an  order,  denoted  by  ”  <” ,  and  where  we  assume  that  the  order  respects 
the  group  law.  The  latter  means  that  for  all  7,  <5, 77  G  F,  we  have  that  7  <6 
implies  that  7  +  77  <  <5  +  77.  We  assume  that  R  is  a  Noetherian  integral  domain 
graded  by  T.  Thus  R  =  ©76r  R-,,  where  each  R1  is  an  additive  abelian  group 
and  R^Rs  C  R1+s  for  all  7,  <5  G  F.  Let  r0  =  {7  €  F|/?7  0}.  Since  0  G  To 

(1  =  l2  G  Rq),  we  see  that  Tq  is,  in  fact,  a  submonoid  of  F.  We  will  assume 
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that  To  generates  T.  We  will  also  assume  that  To  is  well  ordered.  We  note 
that  this  is  equivalent  to  0  <  7  for  all  7  €  IV 

Now  for  each  a  €  R  (a  ^  0)  we  may  write  a  —  £7<70  a7,  with  a7  €  R-, 
and  a70  ^  0.  We  define  lt(a)  =  and  u(o)  =  70-  Set  lt(Q)  =  0  and  u(0)  =  0. 
We  call  lt(a)  the  leading  term  of  a  and  n(a)  the  value  of  a.  Since  R  is  an 
integral  domain,  we  have,  for  all  0,6  €  R,  lt(ab)  =  lt(a)lt(b),  and  if  ab  ^  0 
then  v(ab )  =  v(a)  -+  v(b).  For  a  subset  F  C  R,  we  set  Lt(F)  =  (lt(a)\a  €  F). 
(Here  the  symbol  {■  •  •)  denotes  the  ideal  generated  by  •  •  •.)  Clearly  Lt(F)  is 
a  homogeneous  ideal. 

Definition  2.1  Let  I  be  an  ideal  in  R  and  let  F  be  a  subset  of  I.  We  call 
F  a  Grobner  basis  for  I  provided  that  F  is  finite  and  Lt(F )  =  Lt(I). 


We  will  say  that  a  subset  F  of  R  is  a  Grobner  basis  provided  that  F  is 
a  Grobner  basis  of  the  ideal,  (F),  that  it  generates. 

In  our  context  we  also  have  reduction.  Let  F  =  { /1 }  be  a  finite 
subset  of  R.  For  f,g  €  R,  we  say  that  /  reduces  to  g  modulo  F,  denoted 

p 

f  — ♦  9 -  provided  that  /  -  g  —  £Li  o,/,,  where  lt(f  -  g)  =  lt(f)  and 
v(lt(f-g))  =  max(v(lt(ai)lt(fi)).  We  let  denote  the  transitive,  reflexive 

p 

closure  of  — ►.  We  say  that  /  is  reduced  provided  there  is  no  g  such  that 


The  following  Theorem  provides  in  our  context  the  usual  equivalent  con¬ 
ditions  for  a  set  to  be  a  Grobner  basis.  It  parallels  exactly  the  corresponding 


Theorem  1  in  [7]  and  is  proved  in  exactly  the  same  way.  (See  [2].) 


Theorem  2.2  Let  F  =  {/i, . .  • ,  be  a  finite  subset  of  R.  Then  the  follow¬ 
ing  statements  are  equivalent: 

1.  F  is  a  Grobner  basis. 


2.  For  every  a  €  (F)  we  can  write 

3 

a  =  J2  r>/*  where  tf(a)  =  maxKr,/,))  (1) 

7-T  1  <t<5 

1=1 

and  r,  €  R. 


3.  Let  B  be  any  finite  basis  of  the  syzygy  module  consisting  of  all  sequences 
(61, ... ,  bs)  where  6,  €  R  such  that  bfitifi)  —  0  and  where  we  may 


assume  that  for  all  €  B  u/e  have  that  there  is  a  7  G  T 

satisfying  v(bilt(fi))  —  7  /or  a/l  i  such  that  6,-  ^  0.  Then  for  any 
sequence  (61 , . . . ,  bs)  G  B  we  have  that 

a  -  II  kfi 

i—l 

has  a  representation  as  in  Equation  1  above. 

4 ■  a  — ►+  0  for  every  a  €  ( F ). 

5.  For  B  as  in  3  and  for  every  (b\, . . .  ,bs)  G  B  we  have 

!>/.  -£♦+  0. 

i=i 

3  Grobner  Bases  in  Graded  Rings 

In  order  to  compute  Grobner  bases  in  a  graded  ring  R  we  need  to  make  some 
computability  assumptions  about  R.  We  first  assume  that  the  grading  is 
effective.  By  that  we  mean  that  given  any  r  G  R  we  can  effectively  deter¬ 
mine  lt(r).  It  then  follows,  since  To  is  well  ordered,  that  we  can  effectively 
decompose  r  into  its  homogeneous  components.  We  further  assume  that  we 
can  do  effective  computations  in  homogeneous  ideals.  That  is,  we  can  answer 
the  membership  and  representation  questions  for  homogeneous  ideals.  We 
finally  assume  that  we  can  determine  a  basis  of  the  syzygy  module  of  a  tuple 
of  homogeneous  elements  of  R  (we  can  then  determine  a  homogeneous  basis 
because  of  our  assumption  that  the  grading  is  effective).  (Note  that  if  R  is  a 
polynomial  ring  over  a  field  k  =  R0,  and  T  is  given  by  a  term  ordering,  then 
these  conditions  are  equivalent  to  the  usual  assumptions  that  one  can  solve 
linear  equations  inside  k.) 

Using  these  assumptions,  it  is  easy  to  see  that  the  process  of  reduction 
described  in  the  last  section  is  also  effective.  This  observation  and  Theorem 
2.2  allow  us  to  give  an  algorithm  for  the  computation  of  Grobner  bases  in  R , 
which  is  basically  the  usual  Buchberger  algorithm  over  rings  (cf  [4]  or  [7]). 
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ALGORITHM :  GradedGB ({ri,...,ra},D 
INPUT:  ru...,ra  in  R. 

OUTPUT:  A  Grobner  basis  for  (r\,...,rs). 

G  :=  {ri,...,r,} 

F  :=  0 

While  F  #  G  do 
F  G 

Compute  a  homogeneous  basis  Bi,...,Bt 
of  the  syzygies  of  (lt(r)  |  r  6  F) 

For  i  =  1  to  t  do 

Set  B,  =  6a) 

E?=i  biti  -?-*+  r,  where  r  is  reduced 
If  r  5^  0  then  G  G  U  {r} 

We  note  that  in  this  algorithm,  the  new  elements  obtained  in  the  Grobner 
basis  are  effectively  expressible  in  terms  of  the  input  polynomials.  We  will 
need  this  for  our  application  in  Section  5.  We  also  note  that  given  the 
hypotheses  above  on  R,  we  are  able  to  compute  Grobner  bases  for  any  ideal 
in  R ,  and  hence  we  axe  able  to  answer  the  ideal  membership  question  and 
the  syzygy  question  for  arbitrary  ideals  in  R. 

As  an  easy  example,  we  consider  R  =  fc[x,  y]  for  a  field  k  and  T  =  Z,  where 
the  grading  is  given  by  total  degree.  Let  f\  —  x2y  +  xy 2  +  y,  /2  =  y3  +  x  +  y. 
A  homogeneous  basis  for  the  syzygy  module  of  is  given  by 

(— y2,  a'2  +  xy).  We  compute  —y2f\  +  (x2  +  xy)/2  =  — y3  +i3  +  2 x2y  +  xy2. 
Since  the  latter  polynomial  cannot  be  reduced  by  /i  and  /2,  we  denote  it  by 
fj  and  add  it  to  the  basis.  A  homogeneous  basis  for  the  syzygy  module  of 
(lt(fi),lt(f2),  lt(f3))  is  given  by  (x+y,  -y,  y).  Since  (x+y)/i  +(-y)/2+y/3  = 
0,  the  while  loop  ends  and  we  have  that  {/i,  f-i,  h}  is  a  Grobner  basis  for 
the  ideal  (/i,/*). 

4  Transitivity  for  Grobner  Bases. 

Let  x  =  {xi,...,xn}  be  variables.  Assume  that  we  have  a  term  order  ”<”  on 
Z".  (We  note  that  this  is  just  an  order  on  the  group  Z”  in  the  sense  given  in 
Section  2. )  Given  any  integral  domain  R  there  is  the  natural  Zn  grading  on 
i?[x]  whose  non-zero  homogeneous  summands  are  indexed  precisely  by  Nn. 
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Now  assume  that  R  is,  in  fact,  a  T-graded  ring  as  above.  Set  A  =  T  x  Zn. 
Then  we  can  define  a  A-grading  on  i?[x]  where,  for  7  €  F  and  u  €  Zn  we 
have 

=  ^7  x 

provided  that  u  €  Nn  and  is  {0}  otherwise.  (By  x"  we  mean  x1^  •  •  •  x"* , 
where  v  =  We  see  that  Ao  =  To  x  Nn.  We  will  define  an  order 

on  A  as  follows. 

Definition  4.1  The  elimination  order  on  A  is  defined  as  (71,1/1)  <  (72.1/2) 
if  and  only  if  v\  <  i/2  or  u\  =  i/2  and  71  <  72. 

This  generalizes  the  concept  of  elimination  order  that  occurs  in  the  liter¬ 
ature,  for  example,  in  the  computer  algebra  system  Macaulay,  and  also  in  [1] 
and  [2].  It  is  easily  seen  that  Definition  3.1  makes  A  into  an  ordered  group 
satisfying  the  conditions  assumed  above. 

Now  given  a  6  R  we  use  the  notation  ltr(a)  and  ur(«)  to  specify  the 
leading  term  and  value  of  a  as  defined  in  the  previous  section.  If  /  €  i?[x]  we 
denote  the  same  concepts  with  respect  to  A  by  lt\(f)  and  v\ (/)  respectively. 
Write  /  =  ax"  +  lower  terms  in  x,  where  a  €  R  and  a  ^  0.  Then  set 
ltx(f)  =  ax",  lpx{f)  =  x",  lcx{f)  —  a  and  vx{f)  =  u.  Of  course,  ltx  and  vx  are 
the  leading  term  and  value  concepts  in  R[x\  with  respect  to  the  group  {0}  x  F. 
Also,  lpx(f)  is  called  the  leading  power  product  of  /  and  lcx(f)  is  called  the 
leading  coefficient  of  /.  We  note  that  lt\(f)  =  lt\(ltx(f))  =  ltr(lcx{f))lpx(f) 
and  v\ (/)  =  {vr{lcx(f)),vx(f)).  We  also  define  Ltr,  Ltx  and  Lt\  as  in  the 
last  section.  The  former  will  give  homogeneous  ideals  in  R  and  the  latter  will 
give  ideals  in  i?[x],  homogeneous  with  respect  to  {0}  x  Zn  and  A  respectively. 

In  order  to  state  our  algorithm  we  must  relate  Grobner  bases  in  i?[x]  with 
respect  to  A  =  T  x  Zn  to  Grobner  bases  in  i?[x]  with  resoect  {0}  x  Zn  and 
Grobner  bases  in  R  with  respect  to  T  of  certain  subsets  of  leading  coefficients. 
We  now  define  these  subsets  of  R.  Let  F  —  {/1 be  a  set  of  polynomials 
in  i?[x].  We  adopt  the  notation  that 

fi  =  aiXi+  lower  terms  in  the  x  variables, 

where  AT,  is  a  power  product  in  the  x  variables,  and  «,  €  R.  That  is,  lcx(fi)  — 
a,  and  lpz{fi)  —  X,.  We  will  continue  using  this  notation  throughout  the 
paper. 

For  each  subset  S  of  we  define 


IP 


•  Ds  -  lcmi€SXi  =  lcrm<is{lPz(fi)}, 

•  Fs  =  {fi\Xi  divides  Ds},  and 

•  Gs  —  €  Fs}. 

Also  let  S’  =  {i\fi  e  F$}.  We  say  that  S  is  saturated  if  S’  =  S. 

Theorem  4.2  F  is  a  Grobner  basis  in  i?[x]  with  respect  to  A  =  T  x  Zn  if 
and  only  if 

1.  F  is  a  Grobner  basis  in  f?[x]  with  respect  to  {0}  x  Zn  and 

2.  For  all  saturated  subsets  S  of  {l,...,s},  Gs  is  a  Grobner  basis  in  R 
with  respect  to  T. 

The  proof  is  given  in  [2]. 

The  next  theorem  is  the  basis  of  the  algorithm  we  will  give  in  Section 
5.  It  is  easily  deduced  from  Theorem  4.2  (see  [2]).  It  shows  that  if  we  can 
compute  Grobner  bases  in  R[x]  with  respect  to  {0}  x  Z",  and  in  R  with 
respect  to  T,  then  we  can  compute  Grobner  bases  in  f?[x]  with  respect  to 
T  x  Zn. 

Theorem  4.3  Let  R  be  a  T -graded  ring,  and  F  =  {/i, . . . ,  f3}  be  a  Grobner 
basis  in  Z?[x]  with  respect  to  {0}  x  Z".  For  each  saturated  subset  S  of 
{l,...,s}  let  {ajj, . . . ,  <*5)ts}  be  a  Grobner  basis  of  G$  in  R  with  respect 
to  T.  Write 

as,i  ~  H  bs,ijaj 
jss 

and  define 

fs,i  ~  bs,ij~rrfj 
j<zs 

for  all  i  =  1, . . . ,  ts-  Then 

s  saturated 

is  a  Grobner  basis  with  respect  to  A. 

The  above  theorem  generalizes  a  result  of  Moller  in  [7].  There  T  =  0,  and 
R  is  a  PID  and  the  polynomials  f$,i  s  are  called  T-polynomials. 
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5  A  New  Algorithm  for  Computing  a  Grobner 
Basis  in  R[x\ 

Now  based  on  Theorem  4.3  we  give  our  new  algorithm  for  computing  a 
Grobner  basis  for  an  ideal  in  JZ[x]  with  respect  to  T  x  Z",  which  differs 
from  the  usual  one  as  given  in  Section  3.  The  algorithm  in  Section  3  will  be 
used  to  compute  Grobner  bases  in  H[x]  with  respect  to  {0}  x  Z"  and  to  com¬ 
pute  Grobner  bases  in  R  with  respect  to  I\  (We  note  that  the  computability 
assumptions  on  R  made  in  Section  3  are  inherited  by  #[x]  for  either  of  the 
gradings.) 

INPUT:  gU".,gt  €  /2[x] 

OUTPUT:  A  Grobner  basis  of  (<?i, . . . ,  <?j)  with  respect  to  rxZ". 

F  =GradedGB({<7l, . . .  ,gt},  {0}  x  Zn) 

Set  F  =  {/j, . . . ,  /,}  and  set  /p*(/f )  =  X,  and  /c,(/,-)  =  a, . 

For  each  saturated  subset  5  of  {l,...,s}  do 
G's  =GradedGB(Gs.n 
For  each  a’  €  Gms  do 

Write  a*  =  6,a, 

Set 

F  =  FU{/*} 

If  we  specialize  to  the  case  where  R  —  fc[y]  and  the  grading  on  k[ y]  is 
given  by  a  term  order,  then  the  algorithm  above  gives  a  new  algorithm  for 
computing  a  Grobner  basis  with  respect  to  an  elimination  order  in  the  poly¬ 
nomial  ring  fc[y,x].  So  we  can  compute  a  Grobner  basis  for  a  polynomial 
ring  in  m  +  n  variables  by  computing  many  Grobner  bases  in  m  variables 
and  one  in  n  variables.  The  algorithm  for  computing  the  one  basis  in  n  vari¬ 
ables  relies  on  being  able  to  compute  syzygies  over  k[y]  which  may  be  done 
by  computing  Grobner  bases  in  m  variables.  (Alternatively,  we  could  use 
Moller’s  method  of  inductively  computing  the  syzygies,  cf  [7].)  We  could, 
of  course,  apply  this  came  idea  to  compute  the  Grobner  bases  in  m  vari¬ 
ables  obtaining  a  recursive  procedure.  Since  we  are  always  computing  with 
a  smaller  number  of  variables,  there  is  the  hope  that  time  could  be  saved  in 
this  manner.  However,  an  initial  investigation  with  a  naive  implementation 
on  Maple  was  not  too  encouraging. 
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We  now  give  a  simple  example  of  computing  a  Grobner  basis  using  our 
algorithm.  We  consider  the  ring  Q[x,y,  z,f],  where  Q  is  the  field  of  rational 
numbers,  with  a  lexicographic  term  ordering  with  x  >  y  >  z  >  t.  Let 
fx  —  xt  -  xz 2  +  yz ,  and  /2  =  xyzt2  +  xy  +  y2.  We  use  our  algorithm 
to  compute  a  Grobner  basis  for  the  ideal  7  =  (f\ ,/2).  Let  R  =  Q[z,t], 
where  T  =  Z2  with  the  ordering  defined  by  lex  with  z  >  t.  We  adjoin  the 
variables  x  and  y  with  the  lex  ordering  with  x  >  y.  The  first  step  in  our 
algorithm  is  to  compute  a  Grobner  basis  of  7  with  respect  to  {0}  x  Z2  using 
the  algorithm  of  Section  3.  We  first  need  to  compute  a  basis  of  the  syzygy 
module  of  lt(fx)  =  x(t  -  z2),lt(f2)  =  xy(zt 2  +  1).  This  is  trivially  seen  to  be 
(y(zt2- 1- 1),  —  (t  —  z2)).  The  corresponding  S-polynomial  is:  y(zt2  +  l)/i  —  (t  — 
z2)/2  =  y2(z2t2  +  z2  +  z— t).  This  polynomial  is  not  reducible,  so  we  call  it  /3. 
Now  we  need  to  compute  the  syzygies  for  lt(fx),lt(f2),lt(f3)  —  /3,  but  it  is 
easily  seen  that  the  corresponding  S-polynomials  reduce  to  0.  The  next  step 
in  our  algorithm  is  to  compute  the  saturated  sets  where  lp(fi)  —  x,lp(f2)  ~ 
xy,lp(f3)  =  y2,  which  gives  Si  —  {1,2}  for  £>$,  =  xy  and  S2  =  {1,2,3}  for 
Dsj=xy2.  (Gsi)  =  (t  -  z2,  zt2  +  1)  has  Grobner  basis  {z  +  t3,t5  —  1}.  Since 
z  +  £3 .=  t2(t-  z2)  +  z(zt2  +  1),  and  t5  -  1  =  t4{t-  z2)  +  (zt2  -  l)(zt2  +  1),  we 
need  to  add  the  T-polynomials  /4  =  yt2f\  +  zf 2  =  xyz  +  xyt3  +  y2zt2  +  y2z 
and  —  t4yf\  +  (zt2  -  l)/2  =  xyt 5  —  xy  4-  y2zt 4  +  y2zt2  —  y2.  Finally  we 
note  that  (Gs7)  =  (t  —  z2,  zt2  +  1,  z2t2  +  z2  +  z  —  t)  =  (Gs,),  so  no  new  T- 
polynomials  are  needed.  Therefore  {/1,  fi,  h,  f\,  A}  is  the  desired  Grobner 
basis  of  7.  We  note  that  if  f2  is  eliminated  the  remaining  polynomials  form 
a  reduced  Grobner  basis  of  I. 

As  a  final  comment,  we  note  that  there  is  another  concept  of  Grobner 
basis  ,  the  so-called  strong  Grobner  basis  (as  opposed  to  the  concept  we  used 
so  far  in  this  paper  which  is  sometimes  referred  to  as  a  weak  Grobner  basis  ). 
Namely,  if  R  is  a  Unique  Factorization  Domain,  and  7  is  an  ideal  of  R,  then 
/1, . . . ,  /,  is  a  strong  Grobner  basis  for  7  if  for  every  /  G  7,  there  exists  a  j 
such  that  lt(fj)  divides  lt(f).  Using  material  in  [2j  we  could  have  developed 
an  algorithm  for  this  case  similar  to  the  one  above. 
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Abstract 

We  summarize  our  results  on  the  analysis  of  the  long  time  behavior  of  a  numer¬ 
ical  approximation  of  a  nonlinear  evolution  problem  which  are  given  in  detail  in  [7] 
and  [8].  The  time  step  scheme  is  derived  using  finite  elements  and  is  called  the  con¬ 
tinuous  time  Galerkin(CTG)  method.  It  is  implicit,  of  arbitrary  order,  and  closely 
related  to  the  implicit  Runge  Kutta  (IRK)  methods  which  are  derived  from  Gauss 
Legendre  integration  formulas.  The  main  Theorem  of  this  short  note  states  that 
the  approximate  solution  to  the  evolution  problem  converges  to  a  discrete  steady- 
state  solution.  This  behavior  is  qualitatively  correct  since  the  true  solutions  of  the 
evolution  problem  also  tend  to  static  solutions. 

Presented  at  the  Tenth  Army  Conference  of  Applied  Mathematics  and  Computing.  West 
Point,  New  York.  16-19  June  1992. 

1.  Nonlinear  evolution  problem  from  viscoelasticity: 

We  consider  a  model  equation  for  the  one-dimensional  motion  of  a  viscoelastic  bar  which 
may  undergo  phase  changes.  We  search  for  V  =  U{x.t)  which  is  the  displacement  of  a 
particle  at  time  t  having  position  x  in  some  reference  configuration  and  satisfies 

Utt  =  (ct( Ux)  +  Vxt )x  in  (0, 1 )  x  ( 0.  oc ) 

U(Q.t)  -  0,  for  t  >  0 

(<r{Ux)  +  CTt  )U-0  =  0.  for  /  >  0  ^ 

r(.r.0)  =  C0{x)  and  f'dx.0)  =  V'0(.t  )  in  (0.  1) 

’Partially  supported  by  the  Army  Research  Office  thru  grant  JH5.55-MA. 
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where 

’  s  +  1  if  3  <  -1/2 
<7(s)  =  <  -s  if  -1/2  <  s  <  1/2 
,s  -  1  if  a  >  1/2. 

The  stress  cr  is  nonmonotone  allowing  for  two  phases. 

Multiplying  the  equation  by  (7«,  integrating  in  x  and  t,  and  using  the  boundary 
conditions  with  integration  by  parts  gives  the  energy  equation 

S(t2}  +  I'  f2  U^dtdx  =  S(t ,)  (2) 

Jo  Jt  i 

where 

s(i)  =  J\\u?  +  m*))dx 

and  ip  is  a  double-well  potential 

r*  1 

ip(s)  =  /  a(r)dr  +  -  >  0 
Jo  -1 

and  0  <  t\  <  t2.  This  energy  relation  is  crucial  to  the  analysis  of  the  long-time  behaviour 
of  U .  The  steady  state  equation  is 

(«r(tf*))*  =  0  on  (0.1)  (3) 

(7(0)  -  a(Ux(l))  =  0. 

Functions  V  that  satisfy  the  boundary  conditions  and  have  derivative.  Vx.  that  takes 
on  the  values  ±1  or  0  are  weak  solutions  of  (3). 

Pego  [12]  summarizes  previous  results  on  ( 1)  and  similar  equations  including  studies 
done  on  the  asymptotic  behavior  of  V  as  l  —  oc.  In  the  introduction  he  discusses 
the  results  of  Dafermos  [5],  Andrews  and  Ball  [2]  and  his  own.  Briefly,  they  state  that 
problem  (1)  has  a  unique  global  weak  solution.  Ut  —  0,  (  —  and  er(  )  =  0. 

2.  Continuous  time  Galerkin  methods: 

In  this  section  we  introduce  the  CTG  scheme  for  problem  ( 1).  For  simplicity  of  presen¬ 
tation  we  discretize  space  and  time  with  uniform  meshes. 

0  =  Jo  <  -ri  <  . . .  <  -I'm  =  1,  and  t{)  <  ti  <  <  . . . 

where  ln  —  (jri_i,xn),  h  —  xn  Jn  —  (tn— i,t7)),  k  —  tn  —  /n_j.  and  (Jn  — 

(0,1)  x  Jn. 


24 


The  finite  element  space  for  the  variable  x  is 

Sh  =  {x  €  0(10, 1]) :  xl/„  6  Pi(/„),n  =  with  *(0)  =  0}. 

Fq(7)  is  the  set  of  all  polynomials  of  degree  <  q  on  the  interval  J.  For  the  variable  t 
we  will  use  the  space 

Vk  =  {r  €  C([0,oc))  :  r|/„  6  Pr(/„),n  =  1 . oc} 

where  r  is  also  a  positive  integer.  Letting  V\k  =  Sh  0  V*  our  method  is  as  follows: 

Find  (u.  t>)  6  14*  x  V**  such  that 

((“<  -  V,x))n  ~  0  €  5*  ®  Pr-l(Jn)  ,  , 

((Pt,A))n  +  ((llrt+<T(ttar),Af))llSS0  VA  £Sh*Pr-l(Jn)  ' 

where  n  =  0, 1,2,3, ••  •,  u(-,0)  =  u0  =  i'o  and  n(-,0)  =  u0  -  Vb- 
The  inner  product  is 

((v.  ut))n  =  /  /  vwdxdt. 

Jj„  Jo 

Letting  \  —  ttt-and  A  =  ut  we  obtain  a  discrete  version  of  (2) 

£n  +  f  f  a2xldxdt.  -  £n~x  (5) 

Jj„  Jo 

where  the  sum  of  the  kinetic  and  stored  strain  energy  at  time  t„  is 

r  1  [ 

Sn  =  /  (-V(-Jn)2  +  u(uA-.tn))dx. 

JO  2 

CTG  methods  were  first  introduced  in  the  context  of  ordinary  differential  equations 
by  Hulme  [10].  They  are  closely  related  to  IRK  schemes  based  on  Gauss-Legendre 
quadrature  rules  (See  [10]).  In  most  applications  we  expect  a  k2r  rate  of  convergence 
for  the  IRK  and  CTG  schemes.  See  Akrivis  and  Dougalis  [1];  Baker,  Dougalis  and 
Karakasian  [4];  and  Mckinney  [11]  for  more  on  the  IRK  schemes.  See  [9]  for  more 
references  and  discussion  of  energy  preserving  schemes. 

The  following  result  is  proved  in  [8]: 

THEOREM  l:  Problem  (4)  has  a  unique  solution  for  k  <  1. 
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3.  Long  time  behavior: 


The  key  to  our  analysis  of  u(-,  t)  as  t  -*.oo  is  the  following  Theorem  which  was  originally 
stated  and  proved  in  Elliott  [6]. 

Let 

Zh  =  {z  €  Sh  ■  (cr{zx),<l>x)  =  0  V<p  6  Sh}. 
un  =  u(-,tn),  a-nd  vn  =  v{-,tn). 

THEOREM  2:  Suppose 

|||(«™,u™)|||2  =  \KWlw  +  ||u™||i2(0il)  <  A',  (6) 

lim  ((t(Ux).0x)  =  0  (7) 

n— oc 

lim  ( vn.  \)  =  0  V\  €  Sh.  (8) 

n—oc 

||(*i  -  *2)*IIl2(0.i)  >  A  >  0  Z2  €  Zh.  (9) 

and  for  each  z  €  Zh, 

H|( it™  —  t?n)|||  <  /t|||(  un~i  -r.r"“l)|||  (10) 

where  fj,  >  1  is  a  constant.  Then 

vn  —  0  as  ra  —  oc,  (11) 

«™  —  s  €  Zh  as  n  —  :>c.  (12) 

A  complete  proof  of  this  Theorem  is  given  in  [8],  The  verification  of  the  hypotheses  (6) 
-  ( 10)  so  that  the  desired  results  (11)  -  ( 12)  is  also  given  in  [8]  and  is  summarized  here. 
The  boundedness  property  (6)  follows  from  the  discrete  energy  inequality  (5).  The  fact 
that  uxt  —  0  also  follows  from  (5)  since 

f  f  u*tdxdt  <  £(0). 

Jo  Jo 

From  this  one  can  prove  (7)  and  (8).  The  space  of  steady  state  solutions  can  be  identified 
explicitly, 

Zh  =  {z€Sh:zr\lt  €{-1.0.1}} 

and  then  (9)  follows.  Property  ( 10)  follows  by  an  energy  argument. 
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4.  Conclusions: 


By  using  finite  elements  to  discretize  time  for  a  nonlinear  evolution  problem  from  vis¬ 
coelasticity  a  numerical  method  which  preserves  (discretely)  the  energy  of  the  original 
problem  is  obtained.  With  this  property  it  is  shown  that  the  numerical  approximations 
tend  to  discrete  steady  states  as  t  -+  oo  just  as  the  true  solutions  to  the  evolution 
problem  tend  to  continuous  steady  states. 
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CONDENSATION  OF  THREE-DIMENSIONAL  FINITE  ELEMENTS 
TO  SOLVE  PROBLEMS  OF  WAVE  PROPAGATION 
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ABSTRACT 

The  solution  of  some  three-dimensional  (3-D)  wave  propagation  problems 
can  be  achieved  effectively  using  a  two-dimensional  (2-D)  finite  element 
formulation  not  involving  assumptions  of  plane  strain.  This  formulation  has 
been  adopted  into  a  new  computer  code  to  rapidly  solve  a  broader  class  of  wave 
propagation  problems  using  2-D  methods  without  loss  of  accuracy  thus  resulting 
in  appreciable  savings  in  computer  time. 

The  formulation  of  the  solution  method  involves  two  primary  components: 
the  condensation  of  the  dynamic  stiffness  matrices  to  produce  an  equivalent 
2-D  system  and  the  representation  of  the  distribution  of  loads  in  the  out-of¬ 
plane  direction  using  a  Fourier  expansion.  The  2-D  mesh  is  solved  in  the 
frequency  and  wave-number  domain  and  then  inverse  Fourier  transforms  are 
performed  to  obtain  dynamic  displacements  at  any  location. 

The  purpose  of  this  presentation  is  to  describe  the  general 
formulation  of  the  solution  method,  present  some  results  of  validation  studies 
and  parametric  analyses ,  and  make  comparisons  of  computational  effort  on  the 
US  Army  CRAY  Y-MP  between  the  new  2-D  code  and  a  commercial  3-D  solution 
package . 

INTRODUCTION 

This  paper  summarizes  a  method  to  estimate  the  variation  of 
displacements  in  space  and  time  produced  by  dynamic  loads  in  complex  isotropic 
media,  consisting  of  dipping,  discontinuous,  and/or  irregular  layers,  using  a 
numerical  approximation  method  (Sykora  and  Roesset,  1992).  The  distinguishing 
feature  of  this  work  is  that  the  formulation  allows  three-dimensional  (3-D) 
problems  to  be  solved  using  a  two-dimensional  (2-D)  numerical  model.  To 
implement  this  method,  the  stratigraphy  and  material  properties  of  the  model 
cannot  vary  in  a  horizontal  direction  (2-D  stratigraphy).  However,  the 
distribution  and  extent  of  loads  may  vary  in  both  horizontal  directions  (3-D 
load)  providing  for  the  analysis  of  synthetic  vibratory  sources  such  as  a 
Vibroseis  truck.  These  types  of  problems  cannot  be  solved  analytically  but 
normally  would  be  solved  using  a  laborious  3-D  numerical  approximation. 

The  finite  element  method  was  selected  for  computational  solution  to 
permit  discretization  of  geologic  systems  with  numerous  materials  of  arbitrary 
geometry.  The  formulation  involves  two  primary  components:  the  condensation 
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of  3-D  dynamic  stiffness  matrices  to  equivalent  2-D  matrices  and  the 
representation  of  the  distribution  of  loads  in  the  out-of -plane  direction 
using  a  Fourier  expansion.  This  strategy  was  explicitly  proposed  for  axi- 
symmetric  problems  by  Winnicki  and  Zienkiewicz  (1979)  and  Lai  and  Booker 
(1991)  and  for  3-D  formulations  by  Runesson  and  Booker  (1982,  1983)  and  Lin 
and  Tassoulas  (1987) .  This  strategy  was  used  specifically  for  wave 
propagation  studies  in  horizontally  layered  pavement  systems  by  Kang  (1990) 
and  Hanazato  et  al.  (1991).  The  2-D  system  of  equations  are  first  solved  in 
the  frequency  and  wave-number  domain;  inverse  Fourier  transforms  are  then 
performed  to  obtain  the  solution  as  a  function  of  out-of-plane  distance  and 
time,  if  so  desired. 

One  objective  of  this  study  is  to  examine  the  potential  for  determining 
elastic  moduli  of  materials  in  complex  systems  of  soil,  rock,  and  structural 
materials  from  measured  motions.  The  Spectral-Analysis-of-Surface-Waves 
(SASW)  method  (Nazarian  and  Stokoe  1985a,  1985b)  is  an  existing  method  of 
field  measurement  and  mathematical  inversion  to  determine  the  moduli  of 
horizontally  layered  systems.  This  method  involves  the  use  of  signal 
processing  techniques  on  two  measured  vertical  components  of  motion  spaced  at 
equal  increments  from  the  vibratory  source.  A  similar  procedure  of 
determination  is  desired  for  more  complex  systems.  In  addition,  the  use  of 
artificial  neural  networks  holds  promise  to  improve  inversion  schemes  (Rix  and 
Leipski,  1991).  Therefore,  Rayleigh  wave  propagation  will  be  of  primary 
interest.  Rayleigh  waves  normally  contain  most  of  the  energy  of  wave 
propagation  for  the  near  surface  regime  and  Rayleigh  wave  energy  will 
attenuate  with  distance  at  a  much  lower  rate  than  body  waves.  The  response  at 
the  ground  surface  is  normally  of  greatest  interest  since  it  provides  the 
easiest  access  for  measurements.. 

Assumptions  for  Two-Dimensional  Systems 

A  common  assumption  used  to  reduce  the  computational  effort  for  the 
engineering  analysis  of  stress  and  strain  in  boundary  value  problems  of 
interest  for  geotechnical  engineering  applications  is  that  of  plane  strain. 
Plane  strain  implies  that  the  displacements  in  the  direction  perpendicular  to 
a  two-dimensional  plane  are  equal  to  zero  (Love,  1944).  This  assumption 
reduces  the  analysis  of  a  problem  from  3-D  to  2-D.  Conditions  of  plane  strain 
require  2-D  geometry  and  boundary  conditions  and  loads  that  are  uniform  in  the 
direction  perpendicular  to  the  plane  under  consideration  (Timoshenko  and 
Goodier,  1970).  A  plane  wave  with  particle  motion  only  in  the  2-D  analysis 
plane  is  consistent  with  this  assumption. 

A  surface  load  distributed  over  a  finite  area  induces  stresses  that  vary 
in  three  principal  directions.  If  stresses  vary  in  a  direction  perpendicular 
to  the  analysis  plane,  displacements  and  strains  will  be  non- zero.  Therefore, 
3-D  loads  are  inconsistent  with  plane  strain  assumptions.  This  study  deals 
with  the  analysis  of  "planar"  geosystems  which  proves  to  be  beneficial  from  a 
computational  standpoint.  The  primary  assumptions  are  that  the  geometry  and 
boundary  conditions  of  the  system  and  the  distribution  of  material  properties 
are  planar  (2-D)  but  the  loads  are  non-planar  (3-D).  This  set  of  conditions 
has  a  broader  range  of  applications  than  that  for  plane  strain  while 
circumventing  expensive  3-D  solution  methods. 
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PREVIOUS  STUDIES 


The  evolution  of  the  state  of  knowledge  for  dynamic  loads  acting  on 
elastic  media,  particularly  that  involving  Rayleigh  waves,  was  reviewed  to 
provide  insight  into  which  problems  have  been  solved,  what  approaches  were 
used,  what  conclusions  have  been  reached,  and  which  studies  provide  a  proper 
basis  for  validation  of  the  present  formulation.  Despite  a  large  number  of 
papers  on  the  subject,  few  are  considered  to  be  useful  for  comparative 
purposes  with  this  study  because: 

a.  Almost  all  studies  consider  plane  wave  propagation. 

b .  Many  of  the  studies  consider  only  R-wave  energy  (do  not  include 
in-plane  P-SV  waves), 

c .  Experimental  studies  generally  focused  on  "thin  plate"  tests  which 
have  plane  stress  boundary  conditions,  which  are  generally 
incompatible  with  assumptions  for  this  study,  and 

d.  Plane  strain  conditions  are  generally  assumed  for  theoretical  and 
numerical  studies. 

The  exact  solution  used  for  validation  of  the  computer  program  were  published 
by  Kausel  (1981)  for  point,  disk,  or  ring  loads  acting  on  axi- symmetric 
systems.  These  solutions  were  derived  using  discrete  Green's  functions 
evaluated  numerically  and  are  excellent  approximations. 

Soil  dynamics  studies  conducted  in  the  1950 's  and  1960 's  using  finite 
difference  and  finite  element  methods,  and  in  the  1970 's  and  1980 's  using 
Green's  functions  and  boundary  element  models,  generally  assumed  plane 
harmonic  waves  and  horizontally  layered  media  extending  to  infinity.  The 
subsurface  distribution  of  materials  at  most  sites  is  not  simple  nor  is  it 
conducive  to  analytical  closed-form  solutions  of  wave  propagation  problems. 
Sloping  strata  of  finite  length,  an  irregular  ground  surface,  and/or  two- 
dimensional  load  distributions  are  prevalent.  The  present  study  describes  a 
procedure  to  analyze  wave  propagation  in  these  more  complex  systems  while  not 
assuming  plane  strain  assumptions. 

MATHEMATICAL  FORMULATION 

The  mathematical  formulation  is  based  on  simple  principles  of  Elasto- 
dynamics,  superposition,  Fourier  series  expansion,  and  numerical  discreti¬ 
zation  and  solution  procedures  using  the  finite  element  method.  The  set  of 
assumptions  is  intended  to  be  small,  to  broaden  the  class  of  problems  that  can 
be  solved.  The  primary  assumption  required  for  the  condensation  method 
described  herein  is  that  the  geometry  of  the  system  and  material  properties 
are  planar  (do  not  vary  in  some  horizontal  direction) .  A  number  of  other 
assumptions  were  used  to  derive  the  first  generation  computer  code,  vibJ: 

a ,  Media  are  isotropic, 

b ,  Hysteretic  behavior  is  represented  by  complex  moduli  relation, 

c ,  Source  produces  vertical,  steady-state  excitation  at  one 
frequency , 

d ,  Base  is  rigid,  and 

e ,  Distribution  of  loads  is  symmetric  about  y-axis. 
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These  assumptions  are  not  necessary  and  some  will  be  phased  out  in  future 
versions  of  the  code.  In  addition,  the  computer  code  does  not  allow  for 
transmitting  boundaries  in  the  2-D  analysis  plane.  Rather,  the  domain  must  be 
discretized  to  include  enough  area  for  the  motions  to  attenuate  sufficiently 
before  being  reflected  back  to  the  area  of  interest. 

Field  Equations 

Two  primary  sets  of  variables  adequately  describe  the  effect  of  forces 
acting  on  linear  systems  --  stresses  and  displacements.  These  variables  exist 
in  the  following  field  equations:  stress  equilibrium,  strain-displacement, 
and  constitutive  equations.  These  three  sets  of  equations  are  combined  in 
terms  of  displacements  to  derive  the  governing  equations  for  the  problem. 

Wave  propagation  involves  the  effects  of  inertia  and  deformation  of  the  media. 
The  effects  of  inertia  result  from  masses  being  accelerated.  The  derivations 
below  apply  to  isotropic  materials. 

Stress  equilibrium  equations.  The  summation  of  stresses  acting  on  a 
small  rectangular  parallelepiped  in  three-dimensional  Cartesian  space  and 
Newton's  second  law  of  motion  neglecting  body  forces  are  used  to  derive  the 
stress  equilibrium  equations.  The  equations  of  motion  using  indicial  notation 
and  the  soil  mechanics  convention  of  compressive  forces  as  positive  and 
accounting  for  the  symmetry  of  the  Cauchy  stress  tensor  are: 


aU.j  =  "P°i 


(1) 


where 

o  -  stress  components  [F/L2] 
p  -  mass  density  [F-s2/L] 

"  -  d2  /at2  [1/s2] 

Strain-displacement  equations.  The  strain-displacement  (compatibility) 
equations  are  derived  from  small  strain  theory.  The  equations  for  a 
displacement  field,  u  ,  are: 


eij  =  \  <ui .i  +  uj.i>  (2) 

and  are  often  referred  to  as  engineering  measures  of  strain. 

Constitutive  equations.  The  constitutive  equations  provide  the  means  to 
relate  stress  and  strain;  they  define  the  deformability  of  the  material. 
Individual  material  layers  are  assumed  to  be  homogeneous,  isotropic,  and 
visco-elastic.  To  begin  the  formulation  of  constitutive  relations,  consider 
the  simplest  case  of  linear  elasticity  proposed  by  Hooke.  For  homogeneous  and 
isotropic  conditions: 


9 a  =  (y^)  +  2<5  (3) 

where 

G  -  shear  modulus 
v  -  Poisson's  ratio,  and 

bLi  is  the  Kronecker  delta:  [  &LJ-0  if  i«»j  I 

1  »VJ-1  if  i-j  f 
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Soil  is  an  inelastic  material  --  energy  dissipates  from  friction  as 
waves  travel  through  rt.  This  phenomenon  is  called  material  damping  and 
mathematical  models  are  used  to  approximate  it  in  governing  equations.  One 
form  of  damping,  called  hysteretic,  is  independent  of  the  frequency  of 
excitation.  This  form  of  damping  can  be  introduced  into  the  formulation  for 
frequency -domain  analyses  through  the  Correspondence  Principle  (Wolf  1985)  . 
This  principle  states  that  the  elastic  stiffness  (in  this  case  shear  modulus) 
is  replaced  by  a  complex  stiffness  to  obtain  the  damped  solution.  The  results 
of  this  study  are  expected  to  be  applied  at  distances  greater  than  one 
wavelength  from  the  source  (e.g.,  Nazarian  and  Stokoe,  1985a;  Kang,  1990) 
where  shear  strains  from  synthetic  sources  are  small  .  The  following 
relationship  is  commonly  used  to  model  linear-hysteretic  behavior  for  small 
shear  strains  (and  small  values  of  damping) : 


G*  *  G(1  *  2 ip) 


(4) 


where 

G*  is  complex  shear  modulus 
P  is  the  damping  ratio  [ - ] 

i  =  v^T 

The  magnitude  of  damping  is  considered  to  be  independent  of  strain  (Hardin  and 
Drnevich  1972;  Johnston,  Toksoz,  and  Timur  1979;  and  Toksoz,  Johnston, 

Timur  1979)  for  the  levels  of  shear  strains  expected. 

Equations  of  Equilibrium 

The  three  sets  of  field  equations  are  combined  to  obtain  the  governing 
equations.  A  stiffness  formulation  was  chosen,  that  is,  a  relation  in  terms 
of  displacements  (also  referred  to  as  displacement  approach) .  These  equations 
are  associated  with  Navier  and  can  be  derived  by  substituting  the  strain- 
displacement  equations  into  the  constitutive  equations,  then,  substituting  the 
resulting  equations  into  the  equilibrium  equations.  Assuming  that  the  body 
forces  are  zero  and  applying  Newton's  second  law,  the  result  is: 


*  u, 


*  -Pfli 


(5) 


These  are  the  partial  differential  equations  that  govern  wave  propagation  in 
three-dimensional  Cartesian  space  for  homogeneous,  isotropic  materials  with  no 
body  forces.  The  partial  differential  equation  is  classified  as  hyperbolic 
leading  to  an  initial  value  problem. 


Finite  Element  Method  in  Three-Dimensional  Cartesian  Space 

The  finite  element  method  is  a  numerical  analysis  technique  used  to 
approximate  the  response  of  a  continuous  body  by  dividing  the  domain  of 
interest  into  a  discrete  number  of  subdomains.  Boundary  conditions  and 
external  forces  are  imposed  at  discrete  nodes  where  the  displacements  are 
calculated.  Results  can  be  obtained  at  any  point  in  the  body  through  the  use 
of  interpolation  functions.  In  general,  as  the  subdomains  become  smaller,  the 
solution  converges  to  that  of  the  continuum.  Many  textbooks  describing  the 
finite  element  method  are  available  with  different  sets  of  notation.  The 
notation  below  closely  follows  that  used  by  Zienkiewicz  and  Taylor  (1989). 
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There  are  two  basic  approaches  to  formulating  a  problem  using  the  finite 
element  method:  the  (direct)  displacement  method  and  the  variational  method. 
The  displacement  method  is  the  most  popular  and  most  easily  understood 
procedure  (Zienkiewicz  and  Taylor,  1989)  and  was  selected  for  this  study.  The 
displacement  method  can  be  easily  used  with  Fourier  superposition  analysis  in 
the  frequency  domain  for  the  solution  of  elastodynamic  problems. 

Displacement  method.  Displacements  are  specified  as  the  unknowns  for 
the  displacement  method.  Letting  u  represent  the  vector  of  displacements  at 
any  point  and  U  the  vector  of  displacements  at  the  nodes  of  a  finite  element: 


u  *  H  0  (6) 

where  13  is  the  matrix  of  interpolation  functions.  The  strains  at  any  point 
can  be  represented  as : 


e  =  X  u 


where 


and 


(7) 


(8) 


_d_ 

dy 

0 

Jl 

dz 


_d_ 

dz 

0 


_d_ 

dy 

dx 


(9) 


Then, 


8  =  Bu  =  ENU  =  80  (10) 

where  B  is  a  matrix  containing  the  corresponding  derivatives  of  the 
interpolation  functions. 

The  Correspondence  Principle  allows  the  constitutive  model  to  represent 
hysteretic  behavior  using  complex  moduli  for  solutions  in  the  frequency 
domain.  Superposition  is  valid  because  of  this  linear  representation.  A 
frequency  domain  solution  implies  that  the  excitation  function  must  be 
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periodic.  Calling  D  the  complex  constitutive  matrix  of  the  material: 


D  = 


2G* 

l-2v 
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ooo  1 ~2v 
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0  0 
0  0 
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1  -2  v 
2 
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0 

1  -2v 
2 


(ID 


The  stress  vector  at  any  point  is: 


O  =  D  e 


with: 


(12) 


(13) 


Applying 
relations,  the 


the  principle  of  virtual  work 
equations  of  motion  become: 


and  making  use  of  the  above 


M  O  +  K  U  =  P  (14) 

where  M  is  ' he  mass  density  matrix  defined  by: 


M  =  f  p  HT  H  dv 

J  V 


(15) 


where 

p  -  mass  density 

and  K  is  the  (static)  stiffness  matrix  defined  by: 

K  *  [v  Br  D  B  Jv  (16) 

The  relationships  for  nodal  acceleration,  U  ,  and  displacement,  U  ,  are 
derived  by  imposing  the  steady  state  condition.  Considering  the  load  vector: 


P  -  P  eiut 


(17) 


3C 


where 

P  -  vector  of  amplitudes  of  nodal  forces 
o  -  frequency  of  excitation  (rads/sec) 

Then  the  displacement  vector,  0  ,  can  be  written  as: 


O  ■  O  eiwt 


(18) 


where 

U  -  vector  of  amplitudes  of  nodal  displacements 
and  the  velocity  and  acceleration  vectors  are: 


d  -  i  u  U  eiut 


(19) 


fj  =  -w2  U  ei“t  (20) 

By  substituting  Equations  18  and  20  into  Equation  14  and  canceling  the 
exponential  term,  the  equations  of  motion  are: 


(K  -  M)  0  -  8  0  -  P  (21) 

where  s  is  the  dynamic  stiffness  matrix  of  the  system  defined  by: 


S  =  X  -  »2  M  (22) 

The  dynamic  stiffness  matrix  is  complex  and  a  function  of  frequency. 

Equation  21  can  be  solved  using  matrix  operations  incorporated  in  various 
solution  algorithms  ("solvers"). 

The  formulation  to  this  point  is  specific  to  steady-state,  frequency- 
domain  analyses  for  homogeneous  and  isotropic  materials.  The  formulation  is 
applicable  to  analyses  in  one,  two,  and  three  dimensions  and  any  element 
configuration.  Henceforth,  the  formulation  will  be  specific  to  the  remaining 
assumptions  and  requirements  of  this  study. 

3-D  finite  element.  A  3-D,  isoparametric,  finite  element  with  16  nodes 
was  chosen  to  implement  the  condensation  formulation  described  in  the  next 
section.  Each  node  has  three  degrees  of  freedom.  The  element  has  quadratic 
interpolation  in  the  analysis  plane  (x  and  z)  and  linear  interpolation  in  the 
out-of-plane  direction  (y) .  Equations  15  and  16  can  now  be  stated  in  more 
specific  terms  using  the  transformed  space: 


M  =  fl  f  f1**  v  d£dqdC 

(23) 

=  fj\j\ BT  D  B  |J|  dfdqdC 

(24) 
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where  |  J  |  is  the  determinant  of  the  Jacobian  matrix  for  the  3-D  finite 
element  and  5,  i),  and  (  represent  coordinates  in  iso-parametric  space. 

Fourier  superposition.  Fourier  superposition  is  a  three -step  solution 
process  for  linear  systems  that  involves  a  forward  transformation  into  a 
wavenumber  domain,  the  calculation  of  a  solution  to  Equation  21  at  a  number  of 
increments,  and  the  determination  of  the  total  solution  through  an  inverse 
transformation  of  all  incremental  solutions.  A  time -temporal  frequency 
transform  pair  of  a  load  function  p  are: 


p(<i>)  »  j  *  p (t)  e"1***  dt 

(25) 

p(t)  »  -i-  p(u>)  eUt  do 

2  X  J  -• 

(26) 

Similarly, 

expansion 

the  distance -spatial  frequency  (wavenumber) 
in  the  y-direction  are: 

transform  pair  for 

p{m)  *  J  p(y)  eiBy  dy 
p(y)  =  — ^  f'~  p(m)  e"lmy  dm 


(27) 

(28) 


where 

m  -  wavenumber  (spatial  circular  frequency)  in  y-direction 
Fourier  superposition  applied  in  both  the  time  and  y-spatial  domains  leads  to: 

p  (m,  ci>)  =  J  J*  p  (y ,  t)  e'i<“t'my)  dtdy  (29) 

p(y,t)  =  J  J*"  p (m,  w)  dtodm  (30) 

The  corresponding  transformation  equation  for  displacements  is: 

u(y,t)  =  — i-jj"  j'  u(m,u)  e,,"t~’’,y,  dwdm  (31) 

Making  the  load  vector  specific  to  steady-state  vibrations  with  constant 
amplitude,  the  time-temporal  frequency  transform  pair  reduce  to: 


P(u>  =  P 

(32) 

p  <t)  =  p  e1"1 

(33) 

where  p  is  used  to  represent  amplitude  which  allows  Equations  29  and  30  to 
be  reduced  to: 
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pfm),,  -  p  f'~  p(y)  e1**  dy 


(34) 


p(y,t)  -  p  [~  p(m) u  e-im*  dm  (35) 

*  *• 

for  a  specific  o.  The  corresponding  equations  for  displacements  are: 

u(m)u  -  u  f*m  u(y)  eimf  dy  (36) 


u  (y,  t)  -  u^  u(m)  e~iav  dm  (37) 

Element  condensation.  The  process  of  element  condensation  is  the  key 
aspect  of  the  reduction  of  computational  effort.  Element  condensation  refers 
to  the  process  of  reducing  the  number  of  degrees  of  freedom  by  relating  points 
adjacent  in  the  y-direction  using  the  functional  relationship  of  the  Fourier 
expansion.  The  dependent  degrees  of  freedom  are  then  eliminated  by  expressing 
them  in  terms  of  the  degrees  of  freedom  of  the  in-plane  nodes.  In  this  case, 
the  degrees  of  freedom  corresponding  to  the  nodes  outside  of  the  x-z  plane  are 
eliminated.  Each  node  in  the  2-D  mesh  maintains  three  degrees -of -freedom. 

Consider  an  arbitrary  discretized  model  of  a  physical  system  that  meets 
the  requirement  of  uniform  geometry  and  material  properties  in  one  direction. 
The  coordinate  system  is  chosen  to  have  the  z-direction  positive  down  and  the 
other  in-plane  direction  to  be  x.  Consider  three  vertical  planes  separated  by 
a  distance  of  Ay  at  some  arbitrary  location  along  the  geosystem.  The  3-D 
dynamic  stiffness  matrix  for  any  element  between  the  slices,  such  as  that 
shown  in  Figure  la,  is  calculated  using  Equations  23,  24,  and  then  22. 

The  dynamic  stiffness  matrix  for  a  single  element,  such  as  that  shown  in 
Figure  la,  can  be  partitioned  as: 


«12 

S23 

(38) 


where  the  subscripts  "1"  and  "2"  refer  to  the  degrees  of  freedom  on  the 
positive  and  negative  face  in  the  y-direction,  respectively.  The  assemblage 
of  the  dynamic  equations  for  any  two  finite  elements  adjacent  in  the  y- 
direction,  as  shown  in  Figure  lb,  can  be  reduced  by  canceling  the  time- 
dependent  exponential  term  on  each  side  to: 


' 

1  . 

S11 

S12 

0 

k 

k 

S21 

s22+311 

l-‘ 

-  =  < 

p. 

0 

S21 

a22 

k. 

Pc 

(39) 


where 


If  If 
It  _  It 

"a" 

"b" 

"c" 


denotes 

denotes 

denotes 

denotes 

denotes 


element  in  positive  y-direction  (from  Ay  to  0) 
element  in  negative  y-direction  (from  0  to  -Ay) 

the  degrees  of  freedom  on  face  a  (i.e.,  at  y-  0) 

the  degrees  of  freedom  on  face  b  (i.e.,  at  y-  +Ay) 

the  degrees  of  freedom  on  face  c  (i.e.,  at  y-  -Ay) 
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(C) 


Figure  1.  Condensation  of  finite  elements  adjacent  in 
out-of-plane  (y)  direction 

Using  the  Fourier  expansion  described  earlier  (Equations  34  and  36), 
forces  and  displacements  are  expressed  as: 


?  ( m )  »  f~  ?(y)  e1"*  dy  (40) 

0  (m)  =  j  O(y)  ei"y  dy  (41) 

where  jp  and  u  are  used  to  represent  vectors  of  nodal  forces  and 
displacements,  respectively,  in  m  space.  Rewriting  Equation  39  to  incorporate 
the  Fourier  expansion  of  loads: 


3U  312  9 

eb<m) 

&b(m) 

3J1  3J2+311  S12 

■  =  ■ 

(m) 

9  8„  Sj2 

0o(m) 

(m) 

(42) 


In  the  transform  (m)  space,  the  displacements  on  the  "b"  and  "c"  faces  are 
related  to  the  displacements  on  the  "a"  face  at  any  instant  in  time  by  the 
simple  relationships: 


(m)  *  e-<“4y 


(43) 


ft,  (m)  -  O.(m) 


(44) 


Defining: 


i(m)  -  a21  &~imAy  +  {alx  +  a22  )  *  s12  (45) 

Equations  43  and  44  can  be  substituted  into  Equation  42  to  get  the  system  of 
equations  for  the  equivalent  two-dimensional  system  shown  in  Figure  lc: 


•  On)  d.(m)  -  &,<m)  (46) 

This  formulation,  then,  allows  the  3-D  finite  element  with  a  2-D  geometry  to 
be  represented  with  an  equivalent  2-D  finite  element.  The  representation  of 
surface  loads  are  described  below. 

Surface  loads.  This  study  focuses  on  the  preparation  for  analysis  of 
waves  propagating  from  a  synthetic,  3-D  source.  Vibroseis  trucks  generally 
use  a  rectangular  platen  with  plan  dimensions  on  the  order  of  1  by  2  ra  (  3  by 
7  ft).  At  large  distances  from  the  source  and  with  large  wavelengths,  this 
area  approaches  a  point  source.  Therefore,  the  horizontal  distributions  of 
the  load  considered  for  this  study  were  a  point  load  and  a  rectangular  load  of 
various  sizes.  A  point  source  is  not  a  physical  reality  and  is  difficult  to 
replicate  with  finite  elements.  Kang  (1990)  used  a  point  load  and  circular 
load  as  these  were  appropriate  vibration  sources  for  pavement  systems. 

The  formulation  for  equivalent  nodal  forces  in  the  x-direction  for  point 
and  rectangular  loads  are  described  below.  The  formulation  of  equivalent 
nodal  forces  for  rectangular  loads  involves  integration  of  the  force 
distribution  in  light  of  the  interpolation  function: 

?  =  f^H’pdx  (47) 

The  distribution  of  forces  applied  to  the  platen  is  assumed  to  be  uniform  and 
therefore  the  integration  reduces  to  simple  algebra.  For  example,  a 
continuous,  uniform  load  with  a  total  magnitude  of  unity  (p  •  Ax  -  1),  the 
equivalent  nodal  forces  are  1/6  for  the  endpoints  and  4/6  for  the  midpoint. 

Time -dependent  displacements.  The  real-valued,  time-dependent 
displacements  may  be  obtained  from  the  calculated  complex  displacements,  U. 

If  the  forcing  function  is  of  the  form  sin  ut,  then: 


Uj  =  AtSIN(<dt)  ♦  B,COS(wt)  (48) 

If  the  forcing  function  is  of  the  form  cos  ut,  then: 


ut  =  AtCOS(ut>  -  B,SIN  (ut)  (49) 


where 

hl  -  real  part  of  complex  displacement  amplitude  at  node  i 

-  imaginary  part  of  complex  displacement  amplitude  at  node  i 
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For  the  analysis  of  the  vibrations  produced  by  a  Vibroseis,  Equation  49  is 
more  appropriate.  The  phase  angle  of  motion,  $  ,  is  calculated  by: 


♦  * tan"  (!;) 


(50) 


VALIDATION  STUDIES 

Validation  studies  and  parametric  analyses  were  used  to  prove  that  the 
formulation  and  computer  implementation  are  sound,  accurate,  and  stable  for 
the  limited  problem  class  to  which  accurate  solutions  are  available .  The 
findings  of  validation  studies  are  not  mutually  exclusive  from  the  parametric 
analyses  because  the  definition  of  the  problems  for  validation  should  conform 
somewhat  to  the  findings  of  parametric  analyses.  The  results  of  the  vali¬ 
dation  studies  are  described  below;  the  parametric  analyses  are  described  in 
the  next  section. 

The  best  form  of  validation  consists  of  comparing  the  results  between  a 
subject  program  and  exact  mathematical  relationships  for  several  different 
problems.  Comparisons  with  measured  data  or  prototype  testing  provide  a 
constructive  means  to  confirm  findings  when  conducted  under  certain  controlled 
conditions.  These  comparisons  are  not  appropriate  as  the  primary  means  of 
validation,  however.  Comparisons  with  other  numerical  approximations  are  even 
less  appropriate  for  validation.  Validation  of  the  computer  code  developed 
for  this  study  was  made  through  coniyarisons  with  analytical  results  for  the 
simplest  class  of  planar  geometry  --  a  horizontally  layered  system  extending 
to  infinity.  Green's  function  solutions  formulated  for  axi -symmetric  problems 
by  Kausel  (1981)  were  used  exclusively.  Some  minor  differences  in 
displacement  may  exist  between  the  Green's  function  solutions  and  the  2-D 
approximations  because  the  shape  of  the  load  is  different  --  disk  loads  were 
used  for  the  axi -symmetric  problem  and  square  loads  were  used  for  this  study. 
The  same  total  area  and  total  load  of  unity  were  used  to  minimize  these 
differences.  The  model  systems  used  to  validate  the  computer  code  are 
described  in  the  next  section. 

The  validation  studies  described  in  this  part  pertain  to  variations  in 
system  geometry,  material  properties,  and  aspects  of  finite  element  analysis. 
The  dynamic  vertical  displacements  are  of  primary  interest  because  they 
predominate  in  surface  motions  caused  by  vertical  excitations.  Moreover, 
vertical  vibrations  are  normally  measured  In  non-destructive  testing 
techniques  such  as  the  SASW  method.  For  purposes  of  this  paper,  the  results 
are  presented  in  terms  of  the  variations  of  real  and  imaginary  components  of 
dynamic  displacement  in  the  y-direction  (calculated  at  node  beneath  the 
centroid  of  the  load  and  expanded  out  in  the  y-direction) .  Comparisons  are 
made  at  the  free  (ground)  surface  with  distances  normalized  to  the  wavelength 
of  Rayleigh  waves,  X  ,  for  Model  1.  The  displacements  are  oriented  positive- 
down  to  be  consistent  with  the  convention  used  in  the  formulation  and 
correspond  to  the  top  surface  (z  -0). 

Test  Models  and  Discretization  Schemes 

Four  hypothetical  models  were  created  for  validation  studies  and  are 
shown  with  unit-less  dimensions  in  Figure  2.  These  models  were  designed  to 
represent  ideal  site  conditions  of  horizontally  layered  soil  overlying  rock 
and  realistic  material  properties  (considering  units  of  ft-lb-sec)  while 
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conforming  to  limitations  of  the  analytical  solutions.  All  models  have  the 
same  total  height  (1000  units)  and  are  assumed  to  overlay  a  rigid  material. 
Model  1  is  the  simplest  system  --a  homogeneous  medium  overlying  rock.  The 
range  of  material  properties  for  this  medium  used  in  the  following  comparisons 
are  shown  in  Figure  2.  The  other  three  models  consist  of  four  homogeneous 
layers  overlying  rock  with  different  combinations  of  stiffness. 
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Figure  2.  Test  models  used  for  validation  studies 

A  domain  with  dimensions  1000  units  high  and  2500  units  wide  was  chosen, 
along  with  the  material  properties  and  frequency  of  excitation,  to  be  large 
enough  to  ignore  the  effects  of  reflections  and  correspond  to  about  3X  high  by 
8X  wide.  Three  different  finite  element  meshes  were  created  to  represent  this 
domain,  using  4  by  10,  8  by  20,  and  16  by  40  square  elements.  The  size  of 
these  elements  corresponds  to  0.8X,  0.4X,  and  0.2X,  respectively.  A  plane  of 
symmetry  at  the  left  boundary,  defined  by  x  -  0,  was  utilized  to  reduce  the 
degrees  of  freedom  by  nearly  one -half.  A  frequency  of  excitation  of  3  Hz, 
system  damping  of  2  percent,  and  a  radius  of  load  of  5.64  (total  area  of  100) 
were  used  to  analyze  all  four  models. 

Analytical  Solutions 

The  Green's  function  solutions  formulated  by  Kausel  (1981)  were 
calculated  with  the  computer  code  PUNCH  (Kausel  1989)  using  a  personal 
computer.  The  calculated  solution  approaches  the  exact  solution  as  the  number 
of  layers  increases.  Twenty- five  was  found  to  be  an  adequate  number  of  layers 
to  adequately  represent  these  models  for  further  comparisons  and  validation. 
The  displacements  calculated  using  PUNCH  correspond  to  a  disk  load  with  radius 
r  and  total  load,  P,  of  1  (  *  par2  )  or  a  point  load  with  magnitude  of  unity. 

Element  Performance  to  Static  Loads 

The  specialized  3-D  finite  element  was  evaluated  for  the  ability  to 
represent  static  response  to  various  loads.  This  evaluation  was  accomplished 
by  comparing  the  results  of  two  approaches  with  analytical  solutions.  One 
approach  was  to  place  the  algorithms  defining  the  element  stiffness  into  a 
static  finite  element  computer  code  and  examine  the  response  of  a  cantilever 
beam.  The  other  approach  used  vib3  with  a  point  load  acting  on  a  homogeneous 
body  with  the  frequency  equal  to  0.  Each  of  these  are  described  below. 
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Static  finite  element  code.  A  static  finite  element  code  was  used  to 
evaluate  the  specialized  finite  element.  This  program  evolved  from  an  unnamed 
finite  element  code  used  for  instructional  purposes  at  the  University  of  Texas 
at  Austin.  A  cantilever  beam  was  discretized  with  2,  5,  10,  40,  and  80 
elements  and  subjected  to  tension,  compression,  and  shear- induced  bending 
loads.  The  effect  of  element  shape  was  also  evaluated  by  considering  square, 
rectangular,  parallelogram,  and  trapezoidal  configurations.  Comparisons 
between  calculated  and  closed-form  solutions  for  displacements  and  stresses 
were  good  and  indicate  that  the  algorithms  defining  the  element  stiffness  are 
accurate  for  conditions  of  static  loading. 

Dynamic  code .  The  static  vertical  displacements  calculated  using  vib3 
with  Model  1  at  the  ground  surface  is  shown  in  Figure  3  using  a  mesh  that  was 
16  elements  high  by  40  elements  wide.  The  comparisons  with  Green's  function 
solutions  are  excellent  for  the  real  part  and  very  good  for  the  imaginary  part 
at  distances  slightly  removed  from  the  point  of  load  (greater  than  100  units). 
Comparisons  are  similar  at  all  depths.  The  imaginary  part  should  be  zero  at 
all  distances  but  vib3  produces  non- zero  values  at  locations  close  to  the 
load.  The  less  favorable  comparisons  near  the  point  of  loading  are  common 
when  modeling  a  point  load  using  the  finite  element  method.  These  errors  are 
normally  minimized  through  mesh  refinement  near  the  point  of  loading  but 
accuracy  close  to  the  source  is  not  of  interest  for  this  study. 

Approximations  for  Dynamic  Loads 

The  computer  code  vib3  was  used  to  calculate  dynamic  displacements  for 
each  of  the  four  models  described  previously.  These  results  were  then 
compared  with  the  Green's  function  solutions  presented  in  the  previous 
section.  All  four  models  were  discretized  using  the  finest  mesh.  A  square 
load  with  plan  dimensions  of  5  by  5  centered  about  the  origin  with  a  total 
load  of  1  was  applied  at  a  frequency  of  3  Hz.  The  wavelength  for  Rayleigh 
waves  is  then  about  313  and  the  dimension  of  the  square  elements  are 

62.5  units  or  about  0.21.  The  material  properties  are  listed  in  Figure  1. 

The  parameters  defining  the  condensation  and  Fourier  expansion  for  the 
validation  were  selected  based  on  the  findings  of  Kang  (1990).  Values  of  Ay  = 
0.051  and  the  number  of  Fourier  discretization  points,  NM,  equal  to  256  were 

fixed  for  the  comparisons  and  the  finest  finite  element  mesh  was  used  unless 

otherwise  specified.  This  provided  for  a  discretized  extent  (in  the 
y-direction)  of  -131  (±  6.41),  slightly  less  than  the  total  extent  discretized 
in  the  x-direction  (±  81).  Displacements  at  distances  up  to  51,  or  about  1500 
units,  are  used  for  comparison  because  the  amplitudes  are  rather  small  beyond 
this  distance. 

Model  1:  Homogeneous  system.  The  results  for  Model  1  at  z  -  0  is  shown 
in  Figure  4  and  compared  with  the  Green's  function  solutions.  The  variation 
of  the  real  part  of  the  complex  displacements  compare  well  with  the  Green's 
function  solution.  The  variation  of  the  imaginary  part  closely  follows  the 
Green's  function  solutions.  Both  parts  of  the  calculated  solutions  compare 
more  favorably  at  distances  less  than  31,  about  half  the  distance  expanded  in 
the  y-direction.  The  comparisons  are  also  slightly  better  at  some  depth  as 
compared  to  at  the  ground  surface . 

The  effects  of  varying  Poisson’s  ratio  and  damping  ratio  on  the 
displacements  for  Model  1  were  also  examined.  Comparisons  between  the 
calculated  and  the  Green’s  function  solutions  for  variations  in  v  indicates 
that  the  2-D  approximation  provides  a  reasonable  means  of  representing 
different  v.  The  best  comparison  is  for  v  -  0.40  and  the  poorest  comparison 
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Figure  3.  Vertical  displacements  at  z  -  0  Figure  4,  Vertical  displacements  for  Model 

for  static  point  load  and  comparison  with  with  Green's  function  solution 

Green's  function  solution 


is  for  v  -  0.49.  Generally,  the  imaginary  part  compares  well  but  the  real 
part  varies  somewhat.  The  2-D  approximation  provides  an  accurate  means  of 
representing  damping.  The  accuracy  of  calculated  displacements  improves 
somewhat  as  the  damping  ratio  increases.  The  results  for  5  percent  damping 
compare  much  better  with  the  Green's  function  solution  than  the  results  for  2 
percent.  Both  2  and  5  percent  damping  levels  are  used  for  comparisons 
hereafter . 

Other  models;  Stiffness  varying  with  depth.  The  results  for  Models  2,  3 
and  4  are  shown  in  Figures  5  through  7  and  compared  with  the  Green's  function 
solutions.  Total  distances  are  used  rather  than  normalized  distances  since 
considerable  dispersion  is  expected.  The  results  for  vertical  displacements 
are  nearly  equivalent  to  the  Green's  function  solutions  except  at  the  first 
peak  in  the  real  part  for  all  three  cases. 

PARAMETRIC  ANALYSES 

Parametric  analyses  were  conducted  to  assess  the  sensitivity  of  the 
formulation  and  computer  code  vib3  to  anticipated  ranges  of  system  variables. 
Calculations  were  made  using  Model  1  and  the  finest  mesh  except  in  the  case  of 
examining  sensitivity  to  mesh  size.  Green's  function  solutions  calculated 
using  PUNCH  (Kausel  1989)  are  used  for  comparison. 

Effect  of  Av 

The  effect  of  the  spatial  increment  of  discretization  in  the 
y-direction  was  evaluated  by  comparing  the  results  using  three  values  of  Ay 
between  0.051  and  0.201  (0.051  used  for  validation  study).  The  number  of  FFT 
points,  NM,  was  also  varied  to  keep  the  total  discretized  distance  in  the  y- 
direction,  YTOT,  constant.  This  distance  is  defined  by: 


YTOT  -  NM  •  Ay  (51) 

Keeping  YTOT  constant  serves  to  isolate  the  effects  of  Ay.  The  variation  of 
vertical  displacements  for  the  different  values  of  Ay  are  compared  in 
Figure  8. 

The  large  difference  among  relationships  presented  in  Figure  8  indicate 
that  Ay  has  a  significant  effect  on  the  ability  of  vib3  to  accurately 
calculate  dynamic  displacements.  Comparisons  between  the  calculated 
displacements  and  Green's  function  solutions  are  favorable  when  Ay  <  0.101 
although  some  improvement  is  noticeable  by  decreasing  Ay  to  0.051.  These 
results  are  consistent  with  Kang's  (1990)  who  recommended  that  Ay  <  0.101. 

The  results  for  Ay  >  0.201  are  considered  to  be  too  inaccurate.  The 
parametric  analysis  of  Ay  with  respect  to  1  indirectly  addresses  the  effect  of 
frequency  of  excitation  on  the  results.  For  a  homogeneous  system  (with 
constant  stiffness),  1  is  inversely  proportional  to  frequency.  So,  the 
spatial  increment  Ay  can  also  be  put  in  terms  of  frequency: 


Ay  s 


V 

10  f 


(52) 
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Figure  7.  Vertical  displacements  for  Model  4  Figure  8.  Comparison  of  vertical  displacements 

with  Green's  function  solution  showing  effect  of  Ay  with  Green's  function 

solution 


where 


V  -  phase  velocity 

f  -  frequency  (Hz) 

The  phase  velocity  can  be  taken  equal  to  the  Rayleigh  wave  velocity  as  a  first 
approximation.  Similar  relationships  to  Equation  52  have  been  observed  in 
other  types  of  discretized  solutions  for  dynamic  loading. 

Effect  of  Extent  of  Fourier  Expansion 

The  comparisons  for  the  effect  of  Ay  were  made  using  a  constant  value  of 
YTOT.  The  effect  of  varying  YTOT  was  examined  next.  The  total  distance  was 
varied  at  three  values  between  3.21  and  12.81  (corresponding  to  +  1.61  and 
+  6.41,  respectively)  by  keeping  Ay  constant  at  0.051  and  varying  NM  between 
128  and  512.  The  variation  of  vertical  displacements  for  the  three  values  of 
YTOT  are  compared  in  Figure  9.  The  results  for  the  vertical  displacements  are 
very  good  for  the  case  of  YTOT  -  +  12.81.  The  results  for  YTOT  -  +  6.41  are 
also  good,  especially  for  distances  less  than  31,  and  the  results  for 
YTOT  —  +  3.21  are  considered  to  be  too  inaccurate.  A  threshold  of  101  is 
likely  to  be  appropriate. 

Effect  of  Element  Size 

The  effect  of  varying  the  size  of  the  finite  elements  on  the  solution 
determined  using  the  three  different  meshes  is  shown  in  Figure  10.  The  values 
of  Ax  (-  Az)  corresponding  to  these  three  meshes  are  0.201,  0.401,  and  0.801. 
The  results  for  the  variation  for  the  three  meshes  are  compared  in  Figure  10. 

The  variation  of  vertical  displacements  compare  well  with  the  Green's 
function  solutions  except  for  the  coarsest  mesh  (4  by  10  elements).  The 
results  for  the  coarsest  mesh  are  unacceptable.  The  finest  mesh  produces  peak 
values  of  displacement  slightly  greater  than  the  Green's  function  solution  and 
the  original  mesh. 

Effect  of  Width  of  Load 

The  load  width  in  the  x-  and  y-directions ,  XLOAD,  ranges  from  a  point 
load  to  +  0.0641  (80  by  80  in  total  plan  dimensions  at  3  Hz).  For  all 
practical  purposes  and  at  these  distances  and  depths,  these  loads  are 
essentially  point  loads.  The  results  for  the  variation  for  the  different  load 
widths  are  compared  in  Figure  11.  Little  noticeable  effect  is  evident  as  XLOAD 
is  varied  over  the  specified  range.  A  threshold  of  load  <  +  0.101  appears  to 
be  reasonable  to  maintain  good  accuracy.  The  small  difference  between  the 
results  for  the  point  load  and  the  smallest  square  load  is  somewhat 
surprising.  Kang  (1990)  noticed  a  larger  difference  and  researchers  have 
recognized  the  difficulty  in  calculating  an  accurate  distribution  of 
displacements  from  a  point  load  using  the  finite  element  procedure  without  a 
refined  mesh  in  the  vicinity  of  the  load. 

Computational  Effort 

The  amount  of  time  necessary  to  run  the  program  with  different  system 
parameters  was  reviewed.  The  two  parameters  considered  to  have  the  greatest 
effect  are  the  number  of  FFT  points,  NM,  and  the  number  of  degrees  of  freedom, 
dof.  (Recall  that  the  equations  are  solved  for  only  half  of  the  NM  and  the 
results  mirrored  prior  to  the  inverse  Fourier  transform.)  Comparisons  of  user 
CPU  (central  processing  unit)  times  versus  NM  are  shown  in  Figure  12.  The 
three  finite  element  meshes  described  earlier  were  used  to  provide  a  range  in 
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degrees  of  freedom.  The  solution  times  are  of  the  same  order  as  NM  (linear 
relationship)  for  a  fixed  number  of  dof.  The  slopes  of  these  lines  range  from 
1.3  to  26.  Comparisons  of  user  CPU  times  versus  dof  for  various  NM  are  shown 
in  Figure  13.  The  relationship  is  slightly  non-linear  for  a  fixed  NM;  the 
exponent  of  dcf  is  about  1.12  and  increases  slightly  as  NM  increases. 


/  =  3.0  Hz  uv/A  =  0.05 

0  =  0.02  NM  =  256 

A  =  313.  Y TOT/A  =  ±  6.40 


o  5  '.0  2.0  3.0  4.0  6.0 


horizontal  Distance  (in  A  ) 

Figure  11.  Comparison  of  vertical  displacements  showing 
effect  of  varying  load  width 

The  amount  of  time  saved  in  using  the  present  formulation  over  a 
conventional  3-D  finite  element  formulation  was  estimated  by  solving  the 
problem  for  Model  1  using  the  commercial  software  package  A3AQUS .  Two  planes 
of  symmetry  were  used  such  that  only  a  3-D  quarter  space  was  required  to  be 
discretized.  A  total  discretized  space  of  81  by  31  in  plan  by  31  deep  was 
used  and  the  element  size  was  equal  to  that  used  in  the  8  by  20  mesh 
(Ax  -  Ay  -  Az  -  0.401).  A  3-D  isoparametric  element  with  20  nodes  (quadratic 
interpolation  functions  in  all  three  directions)  was  selected  as  being 
comparable.  The  extent  and  accuracy  of  discretization  in  the  y-direction  is 
roughly  equivalent  to  NM  -  64  and  Ay  -  0.101  which  were  used  with  vibJ .  Free 
end  conditions  were  used  for  non-symmetric  boundaries.  The  calculated  results 


were  not  of  particular  interest.  The  user  CPU  time  required  by  ABAQUS  to 
solve  for  dynamic  displacements  was  about  2820  sec  compared  to  370  sec  using 
vib3 .  A  comparison  of  times  is  shown  in  Figure  12.  Moreover,  almost  8  Mwords 
of  memory  were  required  to  solve  the  problem  using  ABAQUS  whereas  about 
3 . 6  Mwords  were  used  by  vib3 . 

SUMMARY  AND  CONCLUSIONS 

A  method  to  calculate  dynamic  displacements  in  2-D  geosystems  produced 
by  a  harmonic  point  or  rectangular  load  has  been  formulated  and  implemented  in 
a  2-D  finite  element  computer  code  that  functions  on  the  US  Army  CRAY 
supercomputer  at  WES.  The  formulation  involves  creating  a  3-D  dynamic 
stiffness  matrix  and  then  condensing  the  components  into  an  equivalent  2-D 
dynamic  stiffness  matrix.  The  out-of-plane  loads  are  represented  by  a  Fourier 
expansion  and  applied  as  nodal  forces.  The  solution  to  the  system  of 
equations  is  made  for  each  spatial  wavenumber  and  then  the  inverse  Fourier 
transform  produces  the  dynamic  displacements.  The  computer  code  has  been 
validated  with  analytical  solutions  for  Che  case  of  ax i -symmetric  geosystems 
subjected  to  static  and  dynamic  loads.  Parametric  studies  were  performed  to 
determine  how  the  accuracy  of  the  calculated  displacements  are  affected  by  the 
various  input  parameters.  All  comparisons  indicate  that  this  method  is  a 
viable  alternative  to  more  time  consuming  3-D  numerical  solution  methods. 

Validation  studies  were  performed  for  cases  of  static  and 
dynamic  loads  generally  using  reasonable  values  of  system  parameters.  The 
effects  of  static  loads  were  examined  in  terms  of  displacement  and  stress 
field  for  cantilever  beams  in  tension,  compression,  and  torsion  using  the 
specialized  16-node,  3-D,  finite  element  incorporated  into  a  static  3-D  finite 
element  computer  code.  Calculated  values  were  compared  with  closed- form 
elastic  solutions.  The  displacements  produced  by  static  and  dynamic  point  and 
square  loads  were  examined  for  cases  of  a  homogeneous  medium  and  three 
combinations  of  four-layered  media  using  vib3  and  compared  with  Green's 
function  solutions  proposed  by  Kausel  (1981). 

The  analysis  of  parameters  necessary  to  the  program  indicates  that  once 
threshold  values  are  met,  the  formulation  is  stable  to  variations  in 
parameters  defining  the  discretization,  condensation,  and  Fourier  expansion  of 
the  problem.  These  thresholds  are:  Ay  <  0.051,  YTOT  >  +  101,  Ax  -  Az  <  0.301 
(for  quadratic  interpolation),  and  XL0AD  <  +  0.101.  Additional  improvements 
may  be  realized  by  using  even  smaller  values  of  Ax,  Ay,  and  Az .  Displacements 
can  be  calculated  about  8  times  faster  using  the  new  formulation  when  compared 
to  the  3-D  finite  element  code  ABAQUS. 
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ABSTRACT.  The  Stroh  formalism  for  two-dimensional  deformations  of 
anisotropic  elastic  bodies  is  generalized  so  that  it  can  be  easily  applied  to  boundary 
conditions  of  the  type  more  general  than  the  prescription  of  traction,  displacement  or  slip 
boundary  conditions.  A  simple  modification  is  all  it  takes  to  encompass  all  eight  different 
types  of  boundary  conditions.  The  final  solution  to  a  given  problem,  which  looks  very 
similar  to  that  of  unmodified  version,  is  applicable  to  anyone  of  the  eight  boundary 
conditions.  By  relaxing  the  definition  of  1^  and  1^  defined  in  the  paper,  the  number  of 
different  types  of  boundary  conditions  is  increased  to  cover  more  than  eight  types.  It  is 
worth  mentioned  that  the  modifications  required  on  the  Stroh  formalism  are  very  minor. 
Yet  the  results  are  applicable  to  a  rather  wide  range  of  boundary  conditions. 

EXTENDED  SUMMARY.  The  present  work  is  motivated  by,  and  is  an  extension 
of,  the  work  presented  in  [1]  in  which  the  boundary  conditions  at  x2  =  ±1  of  a  semi— infinite 
strip  of  anisotropic  elastic  material  can  be  anyone  of  the  following  eight  conditions: 
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023  —  0. 
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Equations  (la),  (lh)  and  (lc)  are,  respectively,  the  conditions  for  a  traction-free,  rigid, 
and  slip  boundary.  According  to  the  Stroh  formalism  [2,  3]  the  stresses  are  related  to 
the  stress  function  <j)j  by 

ali  =  ~  fy’2’  a2i  ~  ^i’l  ^ 

where  the  comma  stands  for  differentiation  with  xi  or  X2.  Hence 

-x, 

^(xj,  x3)  =  J  cr2i(^,  X2)d£ 

and  the  boundary  conditions  for  —  0  can  be  replaced  by  ({>.  =  0.  In  matrix  notation  u 
and  <j>,  we  may  write  (la— lh)  in  one  equation  as 


Iu  *  +  I4  ♦  =  o,  (3) 

\ + \=  !•  w 

In  the  above  I  is  the  3x3  unit  matrix  and  Iu,  1^  are  3x3  diagonal  matrices  whose  diagonal 
elements  are  either  one  or  zero.  The  special  cases  (la)  and  (lh)  correspond,  respectively,  to 
Iu  =  0, 1^  =  I  and  Iu  =  1, 1^  =  0.  Equation  (3)  encompasses  all  eight  conditions  in  (1). 

We  will  generalize  the  above  derivation  in  two  ways.  First,  the  right  hand  side  of 
(3)  is  replaced  by  something  which  is  prescribed.  Thus  one  may  prescribe  non-zero 
tractions  or  displacement  at  the  boundary.  Secondly,  we  replace  (4)  by 


*u  *u  +  1 J  ~ 

lh  \ =  0 = 1 J  xu’ 


(5) 

(6) 


where  Iu,  1^  are  now  subject  to  (5)  and  (6)  but  otherwise  arbitrary,  and  the  superscript  T 
denotes  the  transpose.  The  Iu,  1^  defined  earlier  satisfy  (5)  and  (6)  but  the  new  Iu,  1^ 
defined  by  (5),  and  (6)  admit  a  wider  class  of  boundary  conditions.  The  explicit 
expressions  of  the  new  Iu,  1^  and  the  physical  meanings  of  some  of  the  Iu,  1^  will  be 
presented.  The  modification  required  on  the  Stroh  formalism  to  encompass  the  general 
boundary  conditions  is  rather  minimal.  The  final  solution,  which  resembles  the  original 
unmodified  solution,  applies  to  a  wide  range  of  boundary  conditions.  One  advantage  of  the 
generalized  formalism  is  that,  when  a  different  boundary  condition  is  desired,  there  is  no 
need  to  re-formulate  the  problem. 
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Abstract 

An  algorithm  is  proposed  for  the  solution  of  non-convex  variational  problems.  In  order 
to  avoid  representing  highly  oscillatory  functions  on  a  mesh,  an  associated  Young  measure, 
which  characterizes  such  oscillations,  is  also  approximated.  Sample  calculations  demonstrate 
the  viability  of  this  approach. 

keywords:  Calculus  of  Variations,  Young  Measures. 


1  Introduction 

A  recent  development  in  continuum  mechanics  is  the  introduction  of  continuum  energy  func¬ 
tionals  modeling  nonlinear  effects  of  crystal  thermoelasticity  [2,  6,  7,  8].  Among  other  things 
these  functionals  can  be  used  to  study  displacive  phase  transformations  and  shape  memory 
effects  [9]. 

A  characteristic  feature  of  the  energy  functionals  is  their  multiple  well  structure.  Typically,  each 
well  represents  a  potential  equilibrium  state  of  the  crystal,  and  at  a  transformation  temperature 
more  than  one  well  is  accessible  to  the  crystal  as  a  stable  configuration. 

The  variational  approach  to  finding  an  overall  equilibrium  state  for  the  crystal  requires  that  the 
energy  functional  be  minimized  in  some  suitable  sense.  In  attempting  such  minimizations,  one 
frequently  encounters  minimizing  sequences  of  rapidly  oscillating  functions.  These  oscillations 
are  usually  a  mathematical  precursor  to  the  formation  of  microstructure.  This  microstructure 
is  characterized  mathematically  by  probability  distributions  which,  in  principle,  can  be  found 
by  taking  certain  averages  of  the  oscillatory  functions. 

In  computational  practice,  the  minimizing  sequences  are  often  constructed  using  a  finite  mesh, 
for  example  by  finite  elements.  The  oscillations  referred  to  above  then  show  up  as  grid  scale 
oscillations  of  the  (generally  nonunique)  minimizer.  As  the  mesh  is  refined,  the  oscillations 
persist  becoming  more  and  more  rapid  while  remaining  of  finite  amplitude  e.g.  (4j.  Usually,  one 

'This  work  was  supported  by  the  Army  Research  Office  and  National  Science  Foundation  through  the  Center 
for  Nonlinear  Analysis 

■Supported  by  Aitfurce  Grant  AFOSR  F49620- 92- 3-0133. 
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wishes  to  know  the  values  of  macroscopic  quantities  associated  with  the  deformation.  These 
are  computed  in  two  different  ways.  Essentially,  linear  functions  of  the  deformation  can  be 
obtained  as  the  limits  of  the  same  linear  functions  of  the  minimizing  sequence.  On  the  other 
hand,  nonlinear  functions  of  the  deformation  (including  energy)  in  general  have  to  be  computed 
as  expected  values  of  the  probability  distribution  mentioned  in  the  previous  paragraph. 

In  some  situations,  the  probabilities  that  are  needed  for  computing  nonlinear  functions  of  the 
deformation  are  known  a  priori.  In  this  paper  we  are  interested  in  the  opposite  case.  Although 
in  principle  it  must  be  possible  to  compute  the  probabilities  from  the  oscillatory  minimizing 
sequence,  in  practice  this  could  be  very  difficult  if  there  were  a  relatively  large  number  of  wells. 
Also,  it  is  easy  to  imagine  that  a  rather  fine  mesh  would  be  necessary  to  accumulate  enough 
data  to  permit  the  evaluation  of  stable  averages.  We  refer  to  this  as  the  “microscopic”  approach. 

An  alternative  method  is  to  compute  with  the  probabilities  as  dependent  variables.  However, 
this  is  feasible  only  if  we  have  some  information  about  the  limiting  probability  distributions 
which  can  occur.  It  turns  out  that  frequently  there  is  enough  prior  information  to  permit  the 
computation  to  be  done  in  that  way.  Our  main  goal  is  to  investigate  this  alternative  approach, 
which  we  will  call  the  “macroscopic”  methodology.  Its  potential  advantage  is  that  since  the 
probabilities  are  smoothly  varying  quantities,  a  relatively  coarse  mesh  can  be  used  to  approxi¬ 
mate  them.  In  this  way  we  can  avoid  the  need  to  deal  with  the  oscillations  explicitly.  Nonlinear 
functions  of  the  deformation  (and  linear  functions  as  a  special  case)  may  be  approximated  using 
the  computed  probability  distributions. 

In  the  sections  which  follow,  we  state  our  algorithm  and  explain  the  ideas  behind  it.  Then  we 
present  the  results  of  some  model  computations.  The  work  reported  is  of  a  preliminary  nature. 
So  far,  we  do  not  have  sufficient  experience  with  the  algorithm  to  make  rational  comparisons 
with  other  approaches.  It  is  hoped  to  address  these  issues  in  a  future  report. 


2  The  macroscopic  formulation 

We  will  begin  with  a  simplified  presentation  of  some  background  information  in  variational 
methods  which  is  sufficient  for  understanding  the  principles  behind  the  formulation  of  the 
algorithm.  More  detailed  accounts  can  be  found  in  [5.  12,  13). 

We  consider  variational  integrals 

J(u):~(  F(.v,  u.  syu)dx.  u  €  H'0lp(fi)m.  (1) 

Ju 

where  ft  €  Rn  is  a  bounded  domain  and  F{  •.•••)  is  continuous.  Inhomogeneous  boundary 
conditions  can  easily  be  accommodated  if  necessary.  The  case  of  most  interest  is  when  F(x.  u,  •) 
is  not  convex  with  respect  to  its  last  variable.  The  multiple  well  property  of  stored  elastic 
energy  functions  causes  this  lack  of  convexity.  In  this  case,  the  infimum  of  ./(•)  usually  cannot 
be  reached  in  and  it  is  necessary  to  admit  generalized  solutions. 

The  standard  example  to  illustrate  this  is 

./( u)  :=  f\  u'2  -  1  )2  +  «”  <Lc.  u  e  1  I'd '4( 0.  1 ). 

./o 

The  sequence  {«*.},  whose  first  three  members  are  illustrated  in  Figure  lb  below,  gives 

•/(«*)  =  f  ul  dx  — •  0  =  inf  J[u), 

Jo  H-0M 
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so  that  {tifc}  is  a  minimizing  sequence.  Also  clear  is  that  lim*..^  u*  =  u  =  0.  On  the  other 
hand,  u'f.  -f*  0  in  any  ordinary  sense,  and  so  for  this  {tr*},  inflvi.t  J(u)  is  not  attained.  It  is  the 
existence  of  the  two  wells  at  ±1,  illustrated  in  Figure  la,  which  is  responsible  for  the  oscillatory 
behavior  of  the  sequence  {u'k}. 


£(p)  =  [p2-U2 


The  behavior  of  the  sequence  {u'k}  strongly  suggests  that  its  “values”  at  any  point  0  <  x0  <  1 
may  be  described  by  the  probability  distribution  u'(x o)  =  ±1  with  probability  1/2.  In  fact  there 
is  a  general  result  from  which  this  can  be  inferred:  for  any  bounded  sequence  in 
II  u-k  ||i,p<  M,  {u/i}  contains  a  subsequence  {uj^}  such  that  Uk3  ~  u  6  Lp{Q)m.  Additionally  a 
subsequence  of  {u^}  exists  (denoted  the  same  way)  with  the  property  that  for  any  continuous 
g  which  is  reasonably  behaved  at  infinity,  and  for  each  x  €  ft  there  is  a  probability  measure  ux 
such  that 

1 l(Vukj)  -  G  €  Lp«l)m ,  (2) 

where 

G(x)=  [  g{y)dvx(y)  (3) 

J  Rn 

for  almost  all  x  €  ft.  A  useful  version  of  this  result,  due  to  Kinderlehrer  and  Pedregal  [10], 
states  that  if  the  sequence  {uk}  is  a  minimizing  sequence  for  a  variational  problem  having 
non-negative  integrand  with  p-growth.  then  g  may  also  have  p- growth. 

A  family  of  probability  measures  {t^}  obtained  in  this  way  is  called  a  family  of  gradient  Young 
measures  [11].  Young  measures  also  exist  which  are  not  gradient  measures.  They  are  derived 
in  a  similar  way  from  bounded  sequences  in  Lp(Sl)m. 


There  is  a  very  useful  characterization  of  Young  measures  vx  due  to  Ball  [1],  We  will  state 
this  for  gradient  Young  measures:  let  denote  the  probability  distribution  of  the  values  of 
VUfc(z)  as  z  is  chosen  uniformly  at  random  from  B(x ,  <5 ),  the  open  ball  with  radius  6  and  center 
x  €  fi.  Then 


lim  lim 

8 — 0  fc— •  oo 


dvx(y)-  /  g(y)d^siy) 
J  Rn 


— ■  0. 


This  result  reveals  how  it  is  the  minimizing  sequence  that  determines  the  probability  distribution 
ux  and  provides  a  way  to  approximate  it. 

The  result  (2 )-( 3)  does  not  give  any  information  about  the  structure  of  the  measure  i/r,  and 
in  particular  whether  it  is  discrete.  General  results  on  this  do  not  appear  to  be  available. 
Nevertheless,  there  is  a  large  class  of  problems  where  it  is  expected  on  physical  grounds  that  vx 
is  indeed  discrete.  This  class  includes  most,  if  not  all.  of  the  continuum  functionals  used  so  far 
to  model  crystal  energy.  Since  we  want  to  make  essential  use  of  discreteness,  we  will  introduce 
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(4) 


it  as  a  hypothesis.  Specifically,  we  will  assume  that 


— 

X>(*)  = 
1=1 


L 

2>( 

1=  1 


1,  0  <  A/  <  1, 


(5) 


where  denotes  a  Dirac  mass  with  pole  at  Ai(x )  and  ARi)  varies  measurably  with  x. 

References  [3,  2,  8]  contain  examples  satisfying  the  discreteness  hypothesis. 

Choosing  g  in  (2)  to  be  F(x,  u(x ),  •)  and  denoting  by  {izt}  a  minimizing  sequence  bounded  in 
W1,p(Q)m,  we  have 

lim  /  F(x,  uk,,  Vukjdx  =  /  {vx,  F(x,  u(x),  •)}  dx, 

where  (r/x,  •)  denotes  the  action  on  the  right  side  of  (3).  Additionally,  choosing  g  in  (2)-{3)  to 
be  the  identity  mapping  shows  that 

L 

V«  :=  X!  A/(x)Ai(*). 

i-i 


These  results  motivate  the  following  generalized  variational  problem:  minimize 

I(u)  :=  [  (ux,  F{x,  u(x),  •)>  dx,  u  €  W01,p(Q)m,  (6) 

JO 


subject  to 


L 

V«(*)  =  Ai(x)Aj(x) 


/=i 


over  suitable  Ai  €  Zp(fl)mn,  A;  €  I  =  1,2 Solutions  to  this  problem  are 

regarded  as  generalized  solutions  to  (1).  Notice  that  classical  solutions  to  (1)  may  be  recovered 
from  the  generalized  formulation  by  taking,  say,  Aj  =  1. 


The  variables  in  the  generalized  formulation  are.  in  principle,  slowiy  varying  or  macroscopic. 


3  Numerical  Algorithm 

In  this  section  we  consider  discretizations  of  the  generalized  problem.  Basically,  we  use  contin¬ 
uous  piecewise  linear  approximations  for  u.  and  piecewise  constant  approximations  for  the  Ai 
and  A(,  1  <  l  <  L.  However,  it  is  important  to  note  that  the  A/  cannot  be  always  be  arbitrarily 
chosen,  since  the  combination  on  the  right  of  (-1)  must  be  a  gradient  Young  measure.  We  present 
a  general  way  to  handle  this  issue. 

3.1  Computing  the  Constraints 

The  algorithm  presented  above  involves  several  constraints,  namely, 

L  L 

Viz  =  Y,XtAt'  y.  A;  —  1.  and  0  <  A/  <  1,  1  <  l  <  L 

i=i  i=i 
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(recall  that  the  discrete  u  is  piecewise  linear,  so  its  gradient  is  piecewise  constant,  as  are 
the  discrete  4/  and  A i).  In  addition  to  these  obvious  constraints,  when  u  is  vector  valued 
further  constraints  on  the  representation  of  the  gradient  are  required  to  guarantee  that  v  — 
Hhii  A/$4j  is  a  gradient  Young  measure.  The  constraints  on  {A;}£_j  are  convex  and  trivially 
accommodated;  however,  the  constraints  associated  with  the  gradient  are  not  convex.  Moreover, 
since  imposing  constraints  can  be  computationally  taxing,  it  is  imperative  to  resolve  them  in 
an  efficient  manner.  Below  we  outline  an  algorithm  that  effectively  eliminates  the  constraints 
on  Vu  analytically. 

We  begin  by  considering  the  case  with  L  =  2,  i.e. 

Vu  =  A40  +  (1  -  A )Ai. 


Letting  b  =  A\  -  Aq,  we  may  write 

40  =  Vu  -  (1  -  A)6,  and  Ai  =  Vu+Xb. 


In  this  situation, 

(F{  x,  u, .),!/)  =  AF[x,u.  40]  +  (1  -  A)F[x.u,  Aj] 

=  AF[x,  u.  Vu  —  ( I  —  A )6]  +  ( 1  —  A)F[x,  u,  Vu  +  A6], 

In  the  scalar  case,  b  €  Rn  can  be  selected  arbitrarily;  however,  when  u  is  vector  valued, 
Vu  G  Rm*n,  and  it  is  necessary  and  sufficient  that  b  =  A\  —  4o  be  a  rank  one  matrix, 
A\  -  4o  =  a©  n,  in  order  to  obtain  a  gradient  Young  measure  (a  G  Rm1  n  €  Rn  may  be  chosen 
freely),  i.e. 

.40  =  Vu  —  ( 1  -  A)a  0  n.  ,4i  =  Vu  +  Aa  ©  n, 


(F(x,u,.),^)  =  XF[x,  u.  40]  +  ( 1  -  A)F[x.  u.  .4i] 

=  AF[.r.  u.  Vu  —  ( 1  -  A)a  :o  n]  +  ( 1  —  A)F[x.  u,  Vu  +  Aa  ©  nj. 

To  obtain  a  representation  of  the  gradient  for  arbitrarily  large  L.  we  repeat  the  construction  as 
follows.  Given  .40  and  4!  as  above,  write 


4o  Aq4oo  +  ( 1  —  Ao)4qi, 


4i  =  Ai4io  +  ( 1  -  Aj)4n, 


where 


4qi  —  4oo  =  bo. 


if  u  is  scalar  valued,  and 


and  4u  —  4io  =  b 


4oi  —  4(jo  —  no  no.  and  .In  —  4|o  —  ai  ©  ni. 

it  u  is  vector  valued.  The  representation  for  the  gradient  then  becomes, 

Vu  =  A  A  o4  oo  +  A(  1  —  Aq)4oi  +  ( l  —  AlAi^tio  +  (1  —  A )( 1  —  Aj  )4n . 

The  quantities  40o  etc.  are  determined  from  0  (or  a  and  n).  A.  bo  (or  ao  and  no),  A0,  etc.,  for 
example, 

4qi  =  V  u  —  ( 1  —  A  )b  +  A06o 
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in  the  scalar  case,  and 

Aqi  =  Vu-(l-A)a®n  +  A0a0  ®  n0 

in  the  vector  case. 

By  repeating  this  process  N  times,  we  obtain  admissible  Young  measures  consisting  of  2‘v  Dirac 
masses.  This  construction  is  conveniently  represented  with  a  binary  tree  as  shown  in  Figure  2. 
Each  matrix  occurring  in  the  representation  of  the  measure  corresponds  to  a  leaf  on  the  tree, 
and  is  uniquely  identified  by  a  binary  word  of  length  N . 

A  =  Vu 


A  ioo  A 101  =  A 10  +  Aio  a  io®n  )0 

Figure  2.  Binary  tree  representation  of  the  micro  structure 
This  representation  of  the  gradient  has  the  following  desirable  properties. 

•  If  F  is  convex  in  its  last  variable,  then  trivially  the  minimum  of  (F(x,  u. .),  i/)  is  attained 
with  b  =  6o . .  •  =  0,  do  =  dj  =  • . .  =  Ydi.  and  in  this  situation  the  problem  reduces  to 
the  classical  algorithm  for  approximating  the  solution  of  elliptic  problems  using  piecewise 
linear  functions. 

•  Given  a  guess  for  the  minimizing  function  u,  minimizing  with  respect  to  the  piecewise 
constant  functions  b.  A  6o  etc.  can  be  done  in  parallel  over  each  element,  suggesting 
the  overhead  associated  with  calculating  a  Young  measure  can  be  minimized  by  taking 
advantage  of  modern  computer  architectures. 


4  Numerical  Results 

4.1  Computational  Considerations 

To  obtain  a  solution  of  the  discrete  problem,  simple  relaxation  was  used  in  conjunction  with 
the  “numerical  tricks"  discussed  below.  The  idea  behind  relaxation  is  to  freeze  all  but  one 
unknown,  £  (a  nodal  value  of  u,  or  a  A  value  for  an  element,  etc.),  and  to  make  one  Newton 
iteration  for  the  Euler  equation  dl/d£  =  0,  i.e.  £n+1  =  The  following 

embellishments  were  required  for  a  practical  algorithm. 
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•  Clearly  it  is  necessary  to  restrict  A  to  lie  in  [0, 1].  Moreover,  since  the  algorithm  degener¬ 
ates  when  A  =  0,  A  =  1,  or  6  =  0,  A  was  required  to  satisfy  c  <  A  <  1  -  c  for  some  e  >  0 
(typically  e  =  10"6  or  10-7).  Additionally,  terms  of  the  form  c(A  —  1/2)2  were  added  to 
the  integrand  to  give  a  preferred  value  of  A  =  1/2  when  6  =  0. 

•  Except  for  the  Dirichlet  data,  initial  values  of  u  =  0,  6  =  0,  and  A  =  1/2  were  chosen.  It 
was  observed  that  initially  oscillations  in  u  might  develop  before  a  suitable  microstructure 
was  found  (this  corresponds  to  computing  a  minimizing  sequence  directly).  In  order  to 
suppress  these  oscillations  while  the  microstructure  developed,  an  “artificial  viscosity”  of 
the  form  ju(Au)2  was  added  to  the  integrand.  In  all  instances,  p.  was  set  to  zero  for  the 
latter  iterations. 

•  Since  relaxation  is  a  local  algorithm,  it  is  prone  to  “getting  stuck”  in  local  minima. 
It  was  frequently  observed  that  microstructure  would  be  present  in  one  element,  but 
not  in  an  adjacent  element.  To  remedy  this  problem,  the  micro-variables  (A.  6  etc.) 
were  substituted  for  those  in  adjacent  elements.  If  this  lowered  the  energy,  the  modified 
microstructure  was  accepted.  This  non-local  move  was  very  effective  for  avoiding  local 
minima. 

This  modified  relaxation  algorithm  was  found  to  be  very  effective  for  the  computation  of  global 
minima.  As  with  classical  relaxation  for  the  solution  of  elliptic  problems,  convergence  was  slow, 
especially  in  the  latter  iterates. 

4.2  One  Dimensional  Examples 

We  consider  one  dimensional  examples  of  the  form. 

I(u)=  f  F(u')  +  (u  -  f)2.  u(0)  =  u0.  u(  1 )  =  it i , 

■Jo 

where  F(p)  =  (p2  -  i)2  (see  Figure  1)  is  the  classical  double  well  potential,  and  /  :  [0,  1]  —  R 
is  specified.  For  one  dimensional  problems,  it  suffices  to  consider  only  one  level  of  the  binary 
tree,  i.e. 

u  —  AAq  +  ( 1  —  A ) ,4j,  Ao  —  u  —  ( 1  —  A )6,  4j  —  u  \b. 


4.2.1  Example  1  (Non— Homogeneous  Young’s  Problem) 

Setting  f(x)  —  [x  —  1/2)2,  the  generalized  solution  of  the  variational  problem  is  u  =  /.  A  = 
1/2(1  +  /  ),  6  =  2.  For  problems  of  this  type  (i.e.  -1  <  /'  <  1).  it  is  possible  to  show  that 
the  discrete  solutions  {«/,}),>()  converge  to  a  in  irl  p(Q.  1)  at  the  optimal  rate  of  h.  Similarly, 
Xh  —  A  in  Lv( 0.  1)  at  optimal  rate  h.  This  is  exhibited  in  Figure  3  where  the  L2( 0.  1)  and 
// 1  ( 0. 1 )  errors  for  «/,  are  tabulated.  The  solution  obtained  with  a  16  element  mesh  is  shown 
in  Figure  4a. 
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(a)  Example  1. 


(b)  Example  2. 

Figure  4.  One  Dimensional  Exapmles  (16  Elements). 
Black  =  well  at  -1,  White  =  well  at  +1 
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Example  2  ji 

No.  Elements 

InP’nWMii 

11“  - 

||  u  —  uh |tr,sro. it 

4 

0.010423 

0.144338 

0.002352 

0.029793  i| 

8 

0.002604 

0.072121 

0.000582 

0.015258 

16 

0.000654 

0.036115 

0.000145 

0.007673 

32 

0.000176 

0.018454 

0.000037 

0.003867 

0.111803 

0.577350 

0.205711 

Figure  3:  Error  Norms  for  One  Dimensional  Examples 


4.2.2  Example  2 

We  consider  a  second  less  trivial  example  involving  a  "broken”  extremal1.  The  nonhomogeneous 
term  is, 

/(*)  =  — 3/12S(*  -  1/2)5  -  l/3(i  -  1/2)3. 

and  the  solution,  given  by 

,  ,  (  fix).  0<x<  1/2. 

l/24(x  —  1/2)3  +  (x  —  1/2).  1/2  <  x  <  1, 

has  microstructure  in  (0,1/2)  and  is  "elliptic'’  on  (1/2,1).  On  (0.1/2).  A  =  1/2(1  +  u  )  and 
6  =  2.  Note  that  the  derivative  of  u  jumps  from  zero  to  one  at  x  =  1/2.  Figure  3  exhibits  the 
optimal  rates  of  convergence  observed  for  {«/,}/»> o  in  £2( 0, 1)  and  Hx( 0. 1).  The  solution  for  a 
16  element  mesh  is  shown  in  Figure  4b. 


4.3  Two  Dimensional  Example 


We  consider  examples  of  the  form 

/(«)=  /  |Vu  —  wjj-’lVu  —  Wo!"’, 

Ju 


where  wj,  W2  €  R 2  are  the  locations  of  the  energy  wells.  We  chose  0  =  (0,  l)2  to  be  the  unit 
square,  and  impose  Dirichlet  boundary  conditions  on  u.  Triangular  meshes  are  constructed  by 
dividing  the  region  into  similar  squares,  and  dividing  them  in  two  along  the  diagonal  with  slope 
—  1.  We  consider  examples  where  the  slope  of  u  lies  on  the  line  joining  W)  and  w2.  so  that 
the  micro-structure  can  be  represented  by  a  gray  scale  with  W|  colored  black  and  w2  colored 
white. 

We  present  two  examples,  one  being  obtained  from  the  other  by  a  rotation  of  90°.  This  il¬ 
lustrates  what  happens  when  the  mesh  is  most  favorably  and  least  favorably  aligned  with  the 
contours  of  the  exact  solution,  u.  In  Figure  5a.  a  Mention  with  w1  =  (  —  1.  —  1 )  and  w2  =  (1.1) 
is  shown  having  exact  solution 


u(x,y) 


■> 


t2  +  l 
r2  -  1 


(•r  +  if). 


V u  ~  Aw|  +  ( 1  -  A  )w2. 

‘This  solution  wa.‘  suggested  bv  Luc  Tartar. 


Ai.r.  >j)  =  ( c*  -  fJ  + '')/(<  *  -  1  )• 


65 


n  i 

ipapiltte 

lillllfefiSl 


(b)  Black  =■  (1,-1)  well,  White  =  (-1,1)  well 


Figure  5.  Two  Dimensional  Example,  8x8  Mesh 


Mai 


The  solution  corresponding  to  a  90°  rotation  is  shown  in  Figure  5b.  Here  wx  =  (1,-1), 
w2  =  (—1, 1)  with  exact  solution 

u{x,y)  =  -^-Te{z-y)  -  e-^-Ax  -  y), 

el  -  1  ei  —  1 

Vu  =  Awi  +  (1  -  A)w2,  A(x,y)  =  (e1+z-v  -  l)/(e2  -  1). 

5  Concluding  Remarks 

In  conclusion,  the  computations  show  that  the  overall  approach  is  a  useful  one,  and  that  it  Hoes 
produce  optimal  rates  of  convergence  under  mesh  refinement.  Certainly,  more  work  must  be 
done  to  implement  the  vector  case,  and  also  to  improve  the  performance  of  algebraic  solvers. 
It  is  hoped  to  address  these  matters  in  the  future. 

Acknowledgment:  We  thank  David  Kinderlehrer  for  many  valuable  conversations  and  sug¬ 
gestions. 
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Kinetically  Driven  Elastic  Phase  Boundary  Motion  Activated  by 

Concurrent  Dynamic  Pulses1 

Jiehliang  Lin  and  Thomas  J.  Pence 
Department  of  Materials  Science  and  Mechanics 
Michigan  State  University, 

East  Lansing,  M3  48824-1226 

Abstract:  We  consider  the  behavior  of  a  phase  boundary  that  is  subjected  to 
concurrent  dynamic  pulses,  one  from  each  side,  in  the  event  that  the  phase 
boundary  motion  is  governed  by  a  simple  kinetic  relation.  The  total  energy  loss 
is  contrasted  to  that  which  would  occur  if  the  two  pulses  were  not  concurrent. 

1.  Introduction 

In  one  spatial  dimension,  nonlinearly  elastic  stress-strain  laws  that  are  not  monotonic  pro¬ 
vide  a  model  for  stress  induced  diffusionless  phase  transformations  [1975E].  In  this  setting  the 
purely  elastic  interaction  of  acoustic  pulses  with  phase  boundaries  are  mathematically  underdeter¬ 
mined,  necessitating  the  consideration  of  additional  physical  effects.  Possibilities  include  criteria 
which  capture  kinetic  effects  [1987T],  [1990GJ,  [I991A],  [1991F],  impedance  effects  [1991P], 
dissipative  effects  [1980J],  [1986H],  [1991PP],  [1992P]  or  other  phenomena  not  accounted  for  by 
the  purely  elastic  theory  L1983H],  [1991T]. 

In  [1992L]  we  considered  maximally  dissipative  dynamical  motion,  meaning  that  phase 
boundaries  move  so  as  to  maximize  the  instantaneous  local  rate  at  which  purely  mechanical 
energy  is  converted  to  nonmechanical  energy.  Understanding  acoustic  pulse  reverberation  pro¬ 
cesses  governed  by  this  principle  focuses  attention  on  a  concurrent  pulse  problem,  namely  a  situa¬ 
tion  in  which  two  pulses,  one  from  each  side,  act  simultaneously  on  a  previously  stationary  phase 
boundary.  In  particular,  [1992L]  addressed  the  question: 

How  does  the  total  energy  loss  for  a  concurrent  pulse  problem  governed  by  the  maximum  dissipa¬ 
tion  rate  criterion  (MD.C.)  compare  to  the  combined  energy  loss  for  two  subsidiary  problems: 
one  involving  only  the  pulse  which  impinges  from  the  front  ( governed  by  M.D.C.  J,  and  the  other 
involving  only  the  pulse  which  impinges  from  the  back  (also  governed  by  M.D  .C.  )! 

We  showed  in  [1992L]  that  the  answer  to  this  question  was  dependent  on  whether  the  two  pulses 
were  of  the  same  sign  with  respect  to  the  strains  in  the  ambient  initial  static  state,  or  whether 
instead  they  were  of  opposite  sign.  Namely,  we  concluded,  under  the  maximum  dissipation  rate 
criterion  (M.D.C.),  that  the  concurrent  pulse  encounter  suffers  the  greater  energy  loss  in  the  event 
that  both  incoming  pulses  are  of  the  same  (strain)  sign,  whereas  the  concurrent  pulse  encounter 
suffers  the  lesser  energy  loss  in  the  event  that  the  incoming  pulses  are  of  opposite  sign. 

Our  purpose  here  is  to  consider  the  same  question  for  the  case  in  which  the  interaction 
1.  Supported  by  the  U.S.  Army  Research  Office  under  contract  DAAL0?-89-G-(X)89. 


dynamics  are  not  governed  by  the  maximum  dissipation  rate  criterion,  but  are  instead  governed  by 
a  linear  kinetic  relation  (L.K.R.)  between  the  driving  traction  on  the  phase  boundary  and  the 
phase  boundary  velocity.  This  linear  kinetic  relation  is  given  in  (3.4).  Thus  we  address  the  ques¬ 
tion 

How  does  the  total  energy  loss  for  a  concurrent  pulse  problem  governed  by  a  linear  kinetic 
relation  (LXX.)  compare  to  the  combined  energy  loss  for  two  subsidiary  problems:  one  involving 
only  the  pulse  which  impinges  from  the  front  (governed  by  LXX.),  and  the  other  involving  only 
the  pulse  which  impinges  from  the  back  (also  governed  by  LXX.)1 

We  show  that  the  latter  is  greater  than  the  former  if  both  incoming  pulses  are  of  opposite  sign, 
which  is  like  the  result  obtained  in  [1992L]  for  the  M.D.C.  case.  In  the  event  that  the  incoming 
pulses  are  of  the  same  sign,  and  if  each  pulse  is  sufficiently  small,  we  find  that  the  former  is 
greater  than  the  latter.  However  we  also  find  numerically  that  certain  situations  involving  suffi¬ 
ciently  large  pulses  of  the  same  sign  result  in  the  concurrent  pulse  encounter  suffering  the  lesser 
energy  loss. 

2.  Families  of  Solutions  to  the  Concurrent  Pulse  Problem 

Since  the  framework  for  the  problem  addressed  here  does  not  depart  from  that  addressed  in 
[1992L]  until  after  the  invocation  of  the  particular  resolution  of  the  nonuniqueness  issue,  it  fol¬ 
lows  that  the  families  of  solutions  to  the  concurrent  pulse  problem  here  are  ._w..tical  to  those 
obtained  in  [1992L].  Accordingly,  we  here  repeat,  verbatim  for  completeness,  the  derivation  of 
these  solution  families  as  previously  given  in  [1992L]: 

Let  t,  y  and  v  denote  respectively  stress,  strain  and  particle  velocity.  Following  [1991P],  we 
consider  a  layer,  0<x<h,  composed  of  an  elastic  material  whose  stress-strain  behavior  in  one 
dimension  is  given  by 


T  =  T  (y) 


C2  Y 

My) 

c2y+d 


forO<Y^Yw 
for  Y*f  -  Y^Ym  ’ » 
for  Y- Ym  . 


x  (-Y)  =  -X  (Y) . 


(2.1) 


where  c  and  d  are  constants  and  xu  (y)  is  a  smooth  decreasing  function  that  renders  x  (y)  continu¬ 
ous.  The  layer  is  assumed  to  be  initially  pre-stressed  in  equilibrium,  so  that  v=0,  with  a  single 
phase  boundary  at  x  =  s0  separating  high  strain  phase  with  y=%  in  x<s0  from  low  strain  phase 
with  Y=Ya  *n  x>so •  The  strain  values  ya  and  are  taken  to  be  the  well  known  Maxwell  strains 
which  have  the  geometrical  interpretation  of  cutting  off  equal  areas  on  the  stress  strain  curve  (Fig. 
1).  An  immediate  consequence  is  that  the  initial  configuration  is  one  of  minimum  energy  [1975E] 
for  the  prevailing  boundary  conditions  governing  the  initial  equilibrium  configuration.  Any  subse¬ 
quent  change  in  the  boundary  conditions  will  give  rise  to  changes  in  the  strain  and  velocity  fields 

governed  by  the  equations 


V,  -  Yf  =  o.  x'  (Y)  tx~v,  -  0-  (2.2) 

In  partied"”,  we  consider  a  loading  condition  at  x=0  that  gives  rise  to  a  square  wave  pulse  with 


2.  Primes  and  subscripts  denote  differentiation  in  the  usual  fashion.  Note  also  that  we  have  taken  the  density 
to  be  equal  to  one  in  (2.2)2. 
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Fig.  1.  Stress-strain  constitutive  response  as  described  by  (2.  1).  The  descending  portion  of 
such  a  constitutive  response  function  is  associated  with  unstable  material  behavior  [  I975E], 

The  strain  intervals:  [-YM’YMh  tYM.Yml.  tYm>  °°).  correspond  respectively  to  a  low  strain  phase, 
an  unstable  phase  and  a  high  strain  phase. 

strain  Yb+AYi  over  a  time  interval  tb,  and  a  loading  condition  at  x  =  h  that  gives  rise  to  a  square 
wave  pulse  with  strain  Ya+^Y2  over  a  dme  interval  ta.  We  shall  not  concern  ourselves  with  the 
specific  loading  conditions  needed  to  generate  these  pulses,  nor  with  restrictions  upon  Ayj  and 
AY2  necessary  to  ensure  compatibility  with  (2.1)  other  than  to  note  that  these  issues  can  be  treated 
in  a  systematic  fashion  [1991P].  According  to  (2.2),  each  pulse  will  travel  toward  the  phase 
boundary  with  speed  c;  furthermore  the  right  moving  pulse  has  width  ctb  and  particle  velocity 
given  by  -cAyj,  while  the  left  moving  pulse  has  width  cta  and  particle  velocity  given  by  cAY2-  The 
encounter  of  such  a  right  moving  pulse  with  the  phase  boundary  is  treated  in  [1991P]  on  the 
assumption  that  the  encounter  ends  before  the  arrival  of  any  pulse  from  the  other  side.  Our  pur¬ 
pose  here,  however,  is  to  study  such  a  concurrent  encounter.  There  are  four  generic  cases:  (rr), 
(r/),  (/r),  (//),  where  (rr)  denotes  the  case  where  the  right  moving  pulse  (with  strain  increment  Ayj) 
encounters  the  phase  boundary  first  and  also  terminates  last,  (r/)  denotes  the  case  where  the  right 
moving  pulse  encounters  the  phase  boundary  first,  but  the  encounter  with  the  left  moving  pulse  ter¬ 
minates  last,  and  the  remaining  two  cases  are  defined  accordingly.  For  the  remainder  of  this  sec¬ 
tion,  and  also  for  Section  3,  we  shall  restrict  attention  to  the  ( rl )  case.  There  are  then  three  distinct 
interaction  periods:  II  j  in  which  only  the  right  moving  pulse  encounters  the  phase  boundary,  ncon 
in  which  both  pulses  encounter  the  phase  boundary  concurrently,  and  Uj  in  which  only  the  left 
moving  pulse  encounters  the  phase  boundary.  Figure  2  diagrams  these  encounters  in  the  (x,t)- 
plane.  According  to  this  figure,  the  following  additional  assumptions  are  also  implicit  in  our  treat¬ 
ment:  (Al)  the  phase  boundary  remains  at  rest  unless  acted  on  by  a  pulse,  (A2)  phase  transitions 
take  place  only  by  movement  of  the  pre-existing  phase  boundary,  and  (A3)  the  phase  boundary 
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velocity  is  constant  during  each  of  the  three  interaction  periods  and  these  three  phase  boundary 
velocities  obey 


-c<sl<c,  -c  < scon  <  c ,  -c<s2<c.  (2.3) 

Further  discussion  of  these  issues  can  be  found  in  [199 IP],  In  addition  we  have  depicted  the  phase 
boundary  as  coming  to  rest  after  the  complete  encounter  has  ended,  in  which  case  the  fields  return 
to  their  initial  conditions  on  each  side  of  the  since  displaced  phase  boundary. 

In  Figure  2,  the  subscripts  7 1  and  Rl  denote  the  fields  in  the  transmitted  and  reflected  pulses 
associated  with  interaction  period  Ilj.  In  addition,  the  (x.t)-domain  with  combined  incoming  and 
reflected  pulse  during  the  interaction  period  rij  is  denoted  by  subscript  51.  A  similar  convention 
is  followed  for  subscripts  72,  R2,  and  52  for  the  pulses  associated  with  interaction  period  n2.  The 
fields  associated  with  the  combination  of  71  and  the  incoming  pulse  characterized  by  Ay2  is 
denoted  by  subscript  71/2.  Finally,  there  are  four  additional  (x,t)-domains  associated  with  pulses 
that  arise  as  a  consequence  of  the  concurrent  interaction  period  nC£m;  these  are  denoted  by  the 
subscripts  5172,  RYT1,  7152,  and  T\R2.  A  consequence  of  (A3),  (2.1)  and  (2.2)  is  that  the  value 
of  strain  and  particle  velocity  are  individually  constant  on  the  individual  (x,t)-domains  associated 
with  the  11  symbols  71,  Rl,  51,  71/2,  72,  R2,  S2,  5172,  /?  172,  7152,  and  TIR2.  The  correspond- 


Fig.  2.  Concurrent  encounter  of  a  right  moving  shear  pulse,  a  left  moving  shear  pulse  and  a  phase  boundary. 
The  shear  strain  yand  velocity  v  in  these  incoming  pulses  and  the  generated  pulses  are  denoted  by  <y,v>. 
The  characteristic  curves  are  indicated  by  dashed  line  segments. 
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ing  22  unknown  values  for  strain  and  velocity,  in  conjunction  with  the  three  unknown  phase 
boundary  velocities  s,,  scon  ,  and  i2,  comprise  the  unknown  quantities  in  the  complete  encounter 

problem.  Relations  connecting  these  25  unknown  values  to  the  parameters  c,  %,  ya,  Ayt,  Ay^ 
which  characterize  the  material,  initial  conditions,  and  loading  conditions  follow  from  the  theory 
of  Riemann  invariants  as  applied  to  (2.1),  (2.2).  In  particular,  this  gives  that  v-cy  is  constant  on 

rlv 

any  line  segment  with  slope  —  =c  in  the  (x,t)-plane  provided  that  it  does  not  cross  the  phase 

dx 

boundary.  Similarly  v+cy  is  constant  on  all  line  segments  with  slope  —  =-c  that  do  not  cross  the 

phase  boundary.  Each  of  these  Riemann  invariant  conditions  generates  8  algebraic  equations 
relating  {y,v}  pairs  between  contiguous  (x.t)-domains;  the  associated  connecting  line  segments 

are  denoted  by  Kj+,..J (g+  and  K/,...,  Kg'  in  Figure  2.  Across  the  phase  boundary,  the  two  Rank- 
ine-Hugoniot  conditions 

[  [v]  ]  =  -i[[yj],  I  [til  =  (2.4) 

associated  with  (2.1),  (2.2)  are  required  to  hold.  These  give  rise  to  an  additional  6  algebraic  equa¬ 
tions,  2  for  each  of  the  3  interaction  periods  rT},  n^n,  and  Thus  in  total  there  are  22  equations 
relating  the  25  unknown  values.  Regarding  the  three  phase  boundary  velocities  as  parameters,  the 
22  equations  are  linear  in  the  22  strain  and  particle  velocities.  The  resulting  22x22  coefficient 
matrix  is  nonsingular  provided  that  none  of  the  three  phase  boundary  velocities  sx,  scon  ,  and  i2, 

take  on  the  values  c  or  - c .  Hence  (2.3)  ensures  that  the  system  can  be  inverted.  Certain  simplifica¬ 
tions  are  achieved  in  the  resulting  problem  due  to  various  uncouplings  (i.e.  zero  blocks  in  the 
coefficient  matrix).  For  example  {ysi,v$j }  and  {yn*vn  1  can  ^  f°und  from  the  2  Riemann  invari¬ 
ant  conditions  associated  with  Kj  +  and  Kf,  along  with  the  2  Rankine-Hugoniot  conditions  associ¬ 
ated  with  interaction  period  rij.  The  resulting  22  field  quantities  are  thus  found  to  be  given  by: 


=  Ui)  +  Cl  ■ 

-s,  =  2 (j|  +  c)  • 

7ri  7b  2  (s,  +  c)  ’ 

c%-yjh 

Vri  2(j,  +  c) 

,  (vb-y*)si 

2(i,-c)  ■ 

c%-yj^ 

vn  -  2(il_c)  • 

.  . . 

2(i,-c>  • 

c<yb-ya)s, 

vsl n-y^y,*^-  2(^„,  • 

C  ^b~  T0)  *con 
2(j^  +  c) 

^b-V^con 

Vn „.r,*4v,^T2+  2(i^_c)  • 

VTIS2  =  cA  yt+cAy2 

“'*** con  L' 

T»,n-1',  +  4TJ  2(Se„  +  C)  • 

C%~y a^con 
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(yb~V  icon 
W-Y0  +  AYl  +  -2(:~-— . 

,  (T6-Ya)i2 

^  =  y-+a^+t (v^r- 

(?*-%,>  *2 

y**  =  y‘+HipcT' 

(yb-yjh 


y-n  =  T6+Ar2 


2(i2  +  c) 


vri«2  — cAT[  -  -777 
Vj2  -  cAY2- 


c(yb-ya^c 


^ (4o»  c) 
c(TTft-Ya)i2 


v*2 - 


2  (i2  -  c ) 

C^fc-T.)j2 

2  (i2  -  e) 

c(^-7a)i2 


'T2 


( i2  +  c) 


(2.5) 


The  phase  boundary  velocities  r, ,  iCO)l  ,  and  i2,  are  undetermined  by  the  above  procedure  and  can 

be  regarded  as  parametrizing  all  possible  solutions.  In  addition  to  (2.3),  various  additional  restric¬ 
tions  upon  the  phase  boundary  velocities  will  arise  due  to  the  requirement  that  the  strain  values  in 
(2.5)  remain  confined  to  the  intervals  associated  with  the  different  branches  of  the  stress-strain 
relation  (2.1).  The  net  effect  of  these  considerations  is  to  generate  additional  inequality  constraints 
beyond  (2.3)  on  the  phase  boundary  velocities.  The  totality  of  inequality  constraints  are  not  mutu¬ 
ally  exclusive  provided  that  Ayj  and  Ay2  are  sufficiently  small.  If,  however,  Ayj  and  Ay2  are  large, 
then  mutual  exclusivity  may  prevail  (see  [199 IP]).  We  shall  henceforth  assume  that  we  are  deal¬ 
ing  with  values  of  Ayj  and  Ay2  which  do  not  give  rise  to  mutual  exclusivity  so  that  three  non¬ 
empty  parametrization  intervals,  3(,  3c<J(l,  and  32,  exist  for  the  three  phase  boundary  velocities. 


3.  Solutions  Obeying  a  Linear  Kinetic  Relation  for  the  (rl)-case 

As  mentioned  in  the  Introduction,  the  freedom  to  determine  the  phase  boundary  velocity  allows 
the  theory  to  accommodate  additional  requirements  upon  conditions  which  govern  phase  bound¬ 
ary  motion.  It  is  at  this  juncture  that  the  treatment  given  here  depans  from  that  given  in  [1992L] 
which  considered  the  maximum  dissipation  criterion  (M.D.C.)  as  the  method  for  resolving  the 
nonuniqueness  issue.  Here  we  instead  examine  a  different  operative  condition,  namely  a  linear 
kinetic  relation  (L.K.R.).  Recall  that  the  motion  of  a  phase  boundary  gives  rise  to  a  change  in  the 

total  mechanical  energy  stored  in  the  mechanical  fields  [1980J].  In  particular,  if  y+=y(s(t)+,r)  and 

y‘=y(s(f)-,0  are  the  strains  directly  adjacent  to  the  phase  boundary,  then  the  energy  loss  rate,  or 
dissipation  rate,  is  given  by 


0(0  =  hoao. 


m 


r  i 

(7)dy--(t(Y+)  +f(Y'))  (Y+  -Y  ) 

V 


(3.1) 


where  f(t)  can  be  interpreted  as  the  phase  boundary  traction  [1991  A].  For  the  concurrent  pulse 
problem,  with  strains  as  given  in  (2.5),  one  finds  that  the  dissipation  rate  and  phase  boundary  driv¬ 
ing  tractions  during  the  three  interaction  periods  are  given  by 


D1  =  tcV-;?)  { (Yn -yf-  (y51  -yb)2} } . 

®con  ~  lc  (c  ~4on)  1  (ffl52  ~  XP  “  (Ts!r2-T&)  1  1  * 

•“con 

D2  =  -^{<V-*2){(y52-y/-(y„-y4)2}}. 


D, 


f\~T 


4, 


£> 


(3-2) 
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where  use  has  been  made  of  the  special  equal  area  property  of  the  Maxwell  strains  ya  and  %  which 
characterize  the  initial  configuration. 

A  kinetic  relation  is  now  assumed  to  govern  the  motion  of  the  phase  boundary.  As  discussed 
in  [1991  A],  one  form  for  such  a  kinetic  relation  is  an  additional  relation  between  phase  boundary 
velocity  s  and  the  phase  boundary  driving  traction  f: 

s  =  F(f)  (3.3) 

where  the  new  constitutive  function  F(f)  is  required  to  obey  f  ■  F(f)  2  0  in  order  to  deliver  D  (/)  2  0 
and  hence  ensure  compatibility  with  a  purely  mechanical  version  of  the  second  law  of  thermody- 
naniics  that  is  appropriate  in  the  present  setting.  In  this  study  we  shall  consider  the  case  of  a  linear 
kinetic  relation  F(f)  =  kf  where  the  phase  boundary  mobility  k  is  a  positive  constant  so  that  (3.3) 
becomes 


i  =  kf  (*>  0).  (3.4) 

Hence,  entering  (3.4)  with  (3.2)  and  (2.5)  one  obtains  the  following  implicit  equations  for  i,, 
sCOn  .  and  i2: 


Ay,= 


(Yb~Yj  csx 


kc2^b-ya)  2  (c2-s\) 


a7,  +  Ay2  =  zi 

ay2  = 


, _  <  (Yb-YjcSco, 

^(yb-ya)  +  2  (c2-sl„) 
(Yb-Yj  ci2 


(3.5) 


k'2(Yb-Yj  2(c2-4)  ' 

Each  of  the  equations  (3.5)  admits  a  unique  solution  obeying  (2.3)  which  we  shall  denote  by 


,  s^rJ ,  and  i2*r) .  For  a  given  pair  (Ay,,  Ay2)  it  may  or  may  not  be  the  case  that  i,k*r'  e  3,, 
s\k'}  e  3  ,  and  s?kr)  e  3n,  however  in  what  follows  we  shall  assume  that  these  inclusions  hold. 

con  con  *•  £ 


Akr) 


Thus  we  deal  with  a  set  in  the  (Ay,,  Ay2) -plane  which  ensure  that  i{*r)  e  3,,  s^n  e  3C(m,  and 

4kr)  e  32  for  the  remainder  of  this  communication,  even  though  we  do  not  here  invesngate  the 
extent  of  this  set 

We  now  introduce  normalized  phase  boundary  velocities,  normalized  pulse  strain  incre¬ 
ments  and  normalized  mobility  k  as  follows: 


Akr) 


*  =  p  *y=  (yb-Y)'  * =  k%-yJ2c  (3-6) 

where  subscripted  and  superscripted  quantities  such  as  scon  are  defined  in  the  obvious  fashion  by 
these  same  normalizations.  Note  that  -l  <  s<  1 .  Then  the  associated  values  for  Dj,  Dcon,  and  Di-, 
which  will  be  denoted  by  D[kr} ,  ,  and  D(2kr) ,  are  given  by 
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where 


or.if^fW(;r>2. 

*  k 

n  (kr)  i*£?)2  cl^b~V2r^K2 
D con  =  t =  = (Scon  )  ’ 

n  (kr)  (^2  ^  C  ,-(*r\2 

D\  1 - = - (s2  )  ' 

K  k 


c(kr)  -  _  -(*0  ~  _  ^(kr)  - 

Si  =  S(Ay,),  5cob  =  S(Ay, +Ay2),  s2  =S(Ay2). 


and  5  (A y)  is  defined  for  all  real  Ay  as  the  unique  root,  within  the  interval  -l  <5  =  S  (Ay)  <  l,  to 
the  equation 

;3  -  _-2  i*  -  . 

S  -kAyS  -  (l  +  ^S  +  fcAy  =  0,  (  (Ay  =  0)  =*  (5  =  0)) .  (3.9) 

From  (3.5)  and  (3,8)  one  also  obtains 


>0,  when  Ay>0, 

S(Ay)  |  =  0,  when  Ay  *  0, 

'  <0,  when  Ay  <0. 


At  the  same  time  one  finds  that  the  first  derivatives  and  second  derivatives  of  S(Ay)  obey 

5’(Ay)  =  -~(S(Ay))  >  0,  for  all  Ay, 
dAy 

and 


S"(Ay)s— ^i(AY) 
d(Ay) 


For  small  Ay  one  obtains  from  (3.9)  that 


<0, 

when  Ay>  0. 

=  0. 

when  Ay  =  0, 

(3.12) 

>0, 

when  Ay<  0. 

]ay+0((Ay)2). 

(3.13) 

In  contrast,  for  small  Ay  under  the  maximum  dissipation  criterion,  one  finds  from  (3.8)  of  [1992L] 

that  5 (Ay)  =  Ay+0((Ay)  ),  which  coincides  with  (3.13)  in  the  event  that  k  =  2.  This  indicates 
that,  for  infinitesimal  pulses,  the  dynamical  theory  based  upon  the  maximally  dissipative  solution 
criterion  should  be  quantitatively  similar  to  the  dynamical  theory  based  on  the  linear  kinetic  rela¬ 
tion  (3.4)  with  i  =  2. 
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4.  Linear  Kinetic  Relation  Solutions  for  the  Concurrent  Pulse  in  General 

The  (rr),  (/r),  and  (//)- cases  can  be  treated  in  a  similar  fashion.  In  all  cases,  formulae  (3.7)2  and 
(3.8)2  hold  during  the  genuinely  concurrent  part  of  the  encounter.  If  and  when  a  portion  of  the 
encounter  only  involves  the  right  moving  pulse  with  strain  increment  Ayi  then  (3.7)j  and  (3.8)i 
hold,  whereas  (3.7)3  and  (3.8)3  govern  those  portions  of  any  encounters  that  involve  only  the  left 
moving  pulse  with  strain  increment  Ay2.  In  order  to  determine  which  of  the  four  possible  cases  is 
that  which  occurs,  let 


«=2~So- 

Then  one  finds  that  the  four  cases  occur  according  to 


(4.1) 


(rl):  q> 0, 
(rr) :  q>  0, 
( lr ):  q<  0. 
(//):  q<  0, 


cta  (c  +  )  (c-i**?)  -ctb(c  +  s[kr))  (c  +  s™)  +2q(c-s[kr))  (c  +  s(ckrJ)  >0. 


«a(C  +  s\kr))  (C-Cn)  -Ctb(C  +  sD  (C  +  Cn)  +  *  < C~  % "')  <*  +  W  )  <  0. 

cta(c-  s\kr))  -ctb(c-s{2kr))  ( c  +  s{ckrn] )  +2 q(c  +  4kr))  (c-*o*«)  <0’ 


.(kr) 


■  (*'■) 


■  (kr) , 


■  (kr)  . 


Cta(c~4kn)  (C-Scon)  -Ctb(c-S2kr))  (C  +  4krJ)  +2q(C  +  s!kr>)  (C-Sctn)  >0' 


(4.2) 


5.  Energy  Loss  for  the  Linear  Kinetic  Relation  Solution  in  the  (r/)-case 
In  this  section  we  begin  the  examination  of  the  question  raised  in  the  Introduction.  For  the  (rf) 
case  discussed  in  Sections  2  and  3,  let  t[kr) ,  t{ckrn] ,  and  i^kr)  denote  the  time  duration  of  the 
encounters  associated  with  the  interaction  periods  flj,  ncon,  and  n2.  These  quantities  are  given  in 


terms  of  ,  s(ckrn} ,  and  s 


.(kr) 


as 


.(kr)  _  2 q 


.(kr) 


.(kr)  ' 
C  +  S  j 


ctb(c  +  sfkr})  -2 q(c-s{kr>) 
(c  +  s[kr>)  (C  —  4<>n  ) 


,(kr) 


Cta(c  +  4kr)'>  (C~4on)  ~C,b(C  +  4ir))  (C  +  4orJ)  +2q(c~4ir>)  (C  +  4!>n) 

(C  +  si*''  )  (c  ~  4m  )  (C  +  4^  > 


so  that  the  total  energy  loss  for  the  complete  encounter  process  governed  by  (L.K.R.)  is 


(5.1) 


AE 


(kr) 


(kr)  (kr)  n  (kr)  (kr)  ,n(kr)  (kr) 
U\  'l  +Ucon  ' con  +  U2  l2  ’ 


(5.2) 


We  now  turn  to  consider  the  energy  loss  which  would  accompany  two  subsidiary  problems. 


The  first  problem  is  that  in  which  only  the  right  moving  pulse,  associated  with  strain  incre¬ 
ment  Ayj,  impinges  upon  the  phase  boundary.  The  encounter  dynamics  are  again  assumed  to  be 
governed  by  (L.K.R.).  The  phase  boundary  velocity  and  dissipation  rate  for  this  problem  can  sim¬ 
ply  be  found  by  setting  Ay2=0  in  the  previous  development;  consequently  they  are  given  by  4*r> 

and  D{kr) .  Similarly,  the  second  problem  is  that  in  which  only  the  left  moving  pulse,  associated 
with  strain  increment  Ay2,  impinges  upon  the  phase  boundary  with  encounter  dynamics  governed 
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by  (L.K.R.).  Hence  the  phase  boundary  velocity  and  dissipation  rate  in  this  problem  are  given  by 
i2*r)  and  D{2kr) .  It  is,  however,  important  to  note  that  the  time  duration  of  the  encounters  are  not 

given  by  /1(*r)  and  t2r) ,  but  rather  are  each  of  a  longer  duration  due  to  the  additional  interaction 
time  which  was  taken  by  the  concurrent  pulse  in  the  original  problem.  These  additional  interac¬ 
tion  times  will  be  denoted  respectively  by  and  8<j*r) ;  they  are  given  by  (see  Figure  3): 


*1  .  .  (ir).  ,  . (ir). 

(c  +  i,  ) 

.  .  (itr).  ,  .(*!■),  n  ,  .  (ir).  ,  .(/trK 

(*D  =  Ctb(C  +  S  1  )  (C  +  Scon  >  ~2g(c~Ji  )(C  +  Scon) 

‘2  (c  +  j{*r) )  (c  —  )  (c  +  S2*r) ) 

so  that  the  total  energy  losses  in  the  two  subsidiary  problems  are  given  by 


'  Ay,  only 


'A^jonly 


=  D(kr)  <t!,kr)  +  5, ikr>) 


Consequently,  the  difference  in  the  energy  loss  between  the  original  problem  and  the  combined 

'J 

energy  loss  for  the  two  subsidiary  problems,  is  given  byJ 


Fig.  3.  The  ( ri )  concurrent  encounter. 
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(5.5) 


Y  a  \p (kr)  /  a  p(kr)  +  AJ<  ^  Rt^kr) 

T(rl)mAE  (A£AY,<«ly  +A£Ar,«ilyJ  ~  Dcon  lcon  ~D\  6,1  ~  °2  °*2  ■ 

In  order  to  develop  a  simple  expression  for  T  (r0  it  is  convenient  to  introduce 

2*<c-i{*r)) 

'<*«'* - ; - j->0’  (5.6) 

where  t^>0  follows  either  from  >  0  or  else  from  its  interpretation  as  a  ‘projected  time’  given 
in  Figure  3.  The  interaction  times  rc(*'} ,  8/j*'*  and  St^kr)  are  then  given  by 


(kr)  _ 


1 


-W* 

1  ~  &COII 


8ij*»  = 


l-Sl 


■Akr)(d' 


Sr<*'>  = 


Akr) 

(  1  +  Scon  ) 


~(kr)  Akr)  'd 

( 1  —  Scon  )  (  1  52  ) 


Substituting  from  (5.7)  into  (5.5)  and  using  (3.7)  yields 


(rt) 


ctAlb~1a)  Akr)  Akr)  Akr) 

- ,Scon  ,52  ), 


where 


■(kr)  ;(kr)  ;(kr) 
<*>(51  Aeon  ,52  .)  = 


;(kr)J 

Scon 


,'krV 

Si 


52  (  1  +  SCOn  ) 


j(kr)  _-(*r)  _(*t)  ±(kr) 

( 1  ~  SCon  )  ( I  —  5l  )  (1+52  )(1~  SCon  ) 


In  view  of  (3.8)  we  define 


(5.7) 


(5.8) 


(5.9) 


„  -  -  -  .  -  .  -  -  .  Akr)  ±(kr).  Akr) 

<1>(At,,Ay2)  a<t>(S(AY,),S(Ay, +Ay2),S(Ay2))  =  «t>(s,  ,scon  .s2  )• 


(5.10) 


Thus  the  question  posed  in  the  Introduction  reduces,  in  the  (r/)-case,  to  a  determination  of  the  sign 
of  <t>.  We  note  that  if  either  Ay,  =  0  or  Ay2  =  0  then  the  corresponding  subsidiary  problem  is  triv¬ 
ial  and  the  remaining  subsidiary  problem  is  identical  to  the  concurrent  pulse  problem,  thus 


*(A7,,A72)  =0,  if  Ay,  =  0  or  if  Ay2  =  0.  (5.11) 

~(kr) 

Analytically,  the  Ay,  =  0  case  of  (5.11)  follows  from  (5.9)  since  Ay,  =  0  implies  5,  =  0, 

j  (*»•)  j  (kr) 

Scon  =  52  ;  while  a  similar  argument  gives  the  Ay2  =  0  case  of  (5.11). 

We  now  begin  an  examination  of  the  sign  of  d>  in  each  of  the  four  quadrants  of  the 
( Ay,, Ay2) -plane.  The  following  result  establishes  that  this  sign  is  negative  in  the  second  and 
fourth  quadrants: 


3.  Note  that  T,  0  =  T**'1  which  is  in  general  different  from  T(r?)  =  Y^  appearing  in  (5.5)  of  [I992L]. 
For  legibility,  the  (kr)  superscript  is  implied  in  what  follows.  A  similar  remark  applies  to  subsequent  func¬ 
tions,  such  as  d>  and  appearing  in  (5.8)  and  (5.10)  respectively.  We  note  in  passing  thgtt  the  functional, 
dependence  upon  k  and  *  has  not  always  been  acknowledged  in  the  argument  lists,  eg  S(& y)  -  S  (A*.  1) . 
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Theorem  1.  If  either  Ay,  <  0.  Ay2  >  0  or  Ay,  >  0, Ay2  <  0,  then  &  (Ay,  ,Ay2)  <  0. 

Proof.  Consider  separateiy  the  five  cases  corresponding  to  the  four  octants  which  comprise  the 
second  and  fourth  quadrants,  along  with  the  diagonal  line  separating  these  octants. 

_  j  (kr) 

Case  1:  the  diagonal  line.  Here  Ay,  +  Ay2  =  0,  yielding  sco„  =  0  so  that  the  first  term  of  the  right 
hand  side  of  (5.9)  is  zero.  The  second  term  and  the  third  term  of  the  right  hand  side  of  (5.9)  then 
give  that  3>  (Ay, , Ay2)  <  0. 

Case  2:  the  N-NW  octant  Here  Ay,<0,  Ay2>0  and  Ay,+Ay2>0,  in  which  case 

^  (kr)  z(kr)  z(kr) 

si  <  0  <  scon  <  sz  .  The  first  term  and  the  third  term  of  the  right  hand  side  of  (5.9)  then  combine 
to  give 


z(kr)2  j(*r)2 

Scon  *2  ( 1  +  Scon  ) 

7 W)  7(M)  To^T 

( 1  —  scon  )  (1+^2  )  (  1  —  scon  ) 

1  ( z(kr)2  z(kr)  z(kr)2  z(kr)  ) 

T(i7)  TilTT l5""  (1+i2  )~S2  <1+jrcon)J 

(l+52  )  (  1  “  Scon  ) 


(5.12) 


1  (-( kr )2  z(kr)2  z(kr)z(kr)  z(kr)  z( kr )  'l 

=  -Tkr)  T*>T~  V con  ~  ^2  +  i2  scon  ( SCOn  ~  s2  )  j  < 

(1+52  )  (  1  “  Scon  ) 

Since  the  second  term  of  the  right  hand  side  of  (5.9)  is  negative,  the  result  (5.12)  then  gives 
^(Ay^Ayj)  <0. 

Case  3:  the  W-NW  octant.  Here  Ay,<0,  Ay2>0  and  Ay,  +  Ay2<0,  in  which  case 

j  (kr)  z(kr)  z(kr) 

•fi  <  Scon  <  0  <  52  .  The  first  term  and  the  second  term  of  the  right  hand  side  of  (5.9)  then  com¬ 

bine  to  give 


;(kr)J 

scon 


;(kr)“ 

Si 


-(kr)  c(kr) 

(  1  —  SCon  )  (  1  —  ) 


z(kr)  j(*r) 

( 1  1  )  ( I  SCon  ) 


fz(kr)2  z(kr)  z(kr)2  ^  (*r) 
(1—^1  )— Si  ( 1  ~  SCon  ) 


-(kr)  -(kr) 

( 1  —  $1  )(1~  Scon  ) 


z(kr)2  .(kr)2  z  (kr)  z  (kr)  ,(kr)  z(kr) 

Scon  ~  Si  +  Si  Scon  (5j  ~  Scon  )  I  <  0. 


(5.13) 


Since  the  third  term  of  the  right  hand  side  of  (5.9)  is  negative,  the  result  (5.13)  then  gives 
4>(Ay,,Ay2)  <0. 

Cases  4  and  5  corresponding  to  the  E-SE  and  S-SE  octants  can  be  treated  similarly. 

QED 
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Theorem  1  indicates  for  the  (rl)  case  that  the  concurrent  pulse  encounter  suffers  the  lesser 
energy  loss  in  the  event  that  the  incoming  pulses  are  of  opposite  signs.  As  mentioned  in  the  Intro¬ 
duction,  a  similar  result  was  obtained  in  (1992L]  for  the  case  in  which  the  phase  boundary  motion 
is  governed  by  (M.D.C.)  instead  of  the  kinetic  criterion  (3.4)  considered  here. 

Before  examining  the  first  and  third  quandrants  of  the  (Ay,,Ay2)  -plane,  we  shall  briefly  con¬ 
sider  the  behavior  of  4>(Ay,,Ay2)  near  the  origin.  Let  partial  derivatives  of  <t>  be  denoted  by 

numerical  subscripts,  e.g.  4>,(Ayi,Ay2)  = -^-4>(Ay,.Ay2)  ,  then  (5.11)  gives  4>(0,0)  =  0. 

oAy, 

4>,  (0,0)  =  0,  4>2(0,0)  =  0,  3>u  (0,0)  =  0,  4*^  (0,0)  =  0,  while  (3.8),  (3.9),  (5.9),  (5.10)  gives 

4>12(0.0)  = - — - - 21  so^at 

(1  /*+ 1/2) 

.  .  .  2Ay,Ay2  _  3 

<t>  ( Ay, ,  Ay2)  =  - — L-L-  +  0  ( (Ay)  )  (5.14) 

(t/*  +  1/2) 

yielding  saddle  point  behavior  near  the  origin.  This  result  corroborates  Theorem  1  and  also  indi¬ 
cates  that  4>>0  in  the  first  and  third  quadrants  provided  that  (Ay,,Ay2)  is  sufficiently  close  to 
(0, 0) .  Again,  similar  qualitative  results  were  obtained  in  [1992L]  for  the  (M.D.C.)-case.  Also,  as 

anticipated,  the  special  case  of  k  =  2  in  (5.14)  gives  the  same  local  result  as  that  for  the  maximally 
dissipative  problem  ( viz  (5.11)  of  [1992L]). 

The  qualitative  behavior  of  solutions  to  the  problem  under  study  here  departs  from  the 
behavior  of  the  solutions  governed  by  (M.D.C.)  as  studied  in  [1992L]  for  incoming  pulse  incre- 


Table  1.  Values  of  <t>(Ay,,Ay2)  for  k  =1. 


1.000 


0.800 


0.600 


nnm 


-0.600 


.368 


0.400  -0.129  -0.136 


0.200  -0.056 


HQ 


-0.200  0.000 


-0.162  0.000 


0.000 


0.000 


0.000 


0.000 


0.000  0.000  0.000  0.000  0.000  0.000  0.000 


0.045  0.047  0.041  0.026  0.000 


0.066 

0.076 

0.078 

0.092 

0.081 

0.097 

0.075 

0.093 

0.200 


0.187 


0.168 


0.134 


0.091 


0.045 


0.000 


0.041 


0.075 


02 


0.400 


0.378 


0.353 


0.294 


0.208 


0.105 


0.000 


-0.098 


-0.183 


-0.253 


-0.130  -0.334  - 


0.600 

0.800 

1.000 

0.545 

0.665 

0.717 

0.531 

0.677 

0.770 

0.463 

0.617 

0.735 

0.340 

0.474 

0.591 

0.178 

0.259 

0.336 

0.000 

0.000 

0.000 

-0.174 

-0.268 

-0.374 

-0  330 

-0.518 

-0.742 

-0.462 

-0.735 

-1.072 

-0.908 

-1.342 

-0.629 

-1.028 

-1.539 

Table  2.  Values  of  4>  (Ay,.Ay2)  for  k  =10. 


SS3 

§ff2j 

-0.800 

-0.400 

iron 

0.000 

0.200 

0.400 

0.600 

0.800 

1.000 

1.000 

-0.787 

n 

-0.552 

ESI! 

0.000 

0.089 

-0.081 

-0.414 

-0.830 

-1.288 

0.800 

Sfffl 

-0.595 

ms 

-0.558 

8113 

0.000 

0.122 

-0.007 

IBB 

-0.661 

-1.069 

0.600 

-0.357 

BtJESJliil 

-0.506 

0.000 

0.161 

0.088 

-0.134 

-0.432 

-0.768 

0.400 

-0.179 

-0.348 

0.000 

0.197 

0.197 

0.066 

-0.130 

-0.359 

0.200 

-0.065 

-0.097 

-0.140 

EE3 

0.000 

0.182 

BBS 

0.233 

0.170 

0.085 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

0.010 

0.016 

0.027 

0.044 

0.059 

0.000 

-0.272 

-0.662 

-0.999 

-1.277 

-1.525 

-0.008 

-0.001 

0.012 

0.033 

0.056 

0.000 

-0.351 

-1.084 

-2.005 

-2.927 

-3.821 

PH 

-0.028 

-0.015 

0.009 

0.039 

0.000 

-0.321 

-1.063 

-2.238 

-3.757 

-5.433 

8Bi 

-0.060 

-0.054 

0.023 

0.000 

-0.283 

-0.941 

-1.993 

-3.567 

-5.722 

beei 

-0.084 

-0.078 

-0.063 

-0.034 

0.009 

0.000 

-0.254 

-0.847 

-1.747 

-3.035 

-4.990 

ments  (Ay^Ay^  which  lie  away  from  the  origin  in  the  first  and  third  quadrants.  We  have  numeri¬ 
cally  calculated  4>(AY,.Ay2)  for  various  pairs  (AY,,Ay2)  and  various  mobility  values  k.  Some  of 
these  results  are  displayed  in  Table  1  (fc  =  1)  and  Table  2  (it  =  10).  It  is  to  be  noted  for  the  *  =  l 
case  of  Table  1  that  4>  >  0  in  the  first  and  third  quadrants  for  all  of  the  tabulated  pairs  (Ay, ,  Ay2)  -  In 
contrast,  for  the  k  =  10  case  of  Table  2  it  is  found  that  O  <  0  in  the  first  and  third  quadrants  for  cer¬ 
tain  pairs  (Ay,,Ay2)  .  Similar  data  for  the  cases  k  =  2  and  100  indicates  that  the  extent  of  the 

(Ay, ,  Ay2)  -domain  in  the  first  and  third  quandrant  on  which  <i>  <  0  increases  with  k.  The  following 
Theorem  provides  a  certain  characterization  of  the  domain  in  the  first  and  third  quandrants  on 

,  kr )  - 

which  <t>  >  0  in  terms  of  the  normalized  phase  boundary  speed  scon  =  S  (Ay,  +  Ay2) . 


Theorem  2.  If  either  Ay,  >0,Ay2>0  or  Ay,<0,Ay2<0,  such  that  |s(Ay,+Ay2) 
6(Ay,.Ay2)  >0. 

Proof.  Using  (3.5)  to  rearrange  (5.9),  one  finds  that  (5.10)  can  be  expressed  as 


<0.5,  then 


OCAy^Ayj)  =  X(Ay,,  Ay2,i)  +  K(Ay,)lVi  (Ay,,Ay2)  +  y(Ay2)Z(Ay,  +  Ay2)lV2  (Ay,.Ay2) ,  (5.15) 

where 


X(Ay,,  Ay2,  k)  =  X(S  (Ay, ) ,  5  ( Ay,  +  Ay2) ,  5  ( Ay2) .  k) . 
y(Ay,)  =  Y  (S  (Ay, ) ) ,  Z(Ay,  +  Ay2)  =  Z  (S  (Ay,  +  Ay2) ) . 

*,(Ay,,Ay2)  =  W,  (S(Ay,) ,S(Ay,  +  Ay2) ) .  tV2(Ay,.Ay2)  =  ^2(S(Ay,  +  Ay2).S(Ay2)>. 


(5.16) 
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with 


-•<*r)  ;(*/•)  ;(kr)  . 

X (jj  t Scon  •  $2  »  ^)  = 


-(*r)  j(*r)  _-(ir)  ^(*r)  j(*r) 

2(1+  Jco*  )  (icon  )  (il  +  J2  $con  ) 


it 


)  = 


;(kr) 

X(scon  )  — 


j(*0 

1  +  Scon 

IVtr)' 

1  "  Scon 


(5.17) 


j(/tr)  -(kr)  ;(kr)  ;(kr)  ;{kr)  ^(tr) 

IV,  (il  *ic«.n  }  “  icon  (  1  +  Scon  )  il  ( 1  +  il  )  , 

j(*r)  j(*r)  j(*r)  j(ir)  ;<*/■)  j(*r) 

^2  (scon  >i2  )  =  icon  ( 1  ~  icon  )  ~  $2  ( 1  ~  i2  )  • 

We  now  examine  the  sign  of  each  of  the  five  functions  appearing  in  (5.17).  The  following  lemma 
addresses  the  function  X(Aylt  Ay2,  k). 

Lemma  A.  X{Ayx,  Ay2,*)>0,  if  Ay,  Ay2  >  0. 


Proof  of  Lemma  A.  Since  k  >  0  and  1  +  sco„  >  0  it  follows  from  (5.17>i  that 

.  .  .  -  -(*»■)  -■(*'■)  -•(*'■)  -(*»■) 

#(Ay,,  Ay2,*)>0,  if  (*«■  )  C*i  +  *2  -fco„ )  >  0.  (5.18) 

^(*0 

In  the  first  quadrant.  Ay,  > 0  and  Ay2>0,  one  has  sCOH  >0,  and  we  examine  the  remaining  factor 

;(*»■)  ;(kr)  _(*<-)  .  . 

s i  +  n  -rCon  of  (5.18)  by  considering  separately  the  two  cases:  Ay,  £  Ay2  and  Ay,  <  Ay2.  In  the 

first  case,  Ay,  s  Ay2  >  0,  an  application  of  the  Mean  Value  Theorem  gives 


-(*»■>  .  .. 

Scon  =S( Ay,+Ay2)  =  S (Ay,)  +Ay25'  (a),  with  0<  Ay,  <o<Ay, +Ay2, 

-f*r) 

S2  =5(  Ay2)  =  S(0)  +  Ay2S"  (b)  =  Ay 2S’(t>),  withO<h<Ay2, 

so  that 

-(*<•)  ;(*r)  ;{kr)  .  -  -  .  . 

i  1  +  ^2  “  SCOn  —  Ay2(S'  ( b )  -5'  (a)),  with  0<  b<  Ay2SAy,  <a<  Ay,  +  Ay2. 

-  *  -(*<•)  _(*r)  _;(ir) 

Now  (3.11)and  (3.12)  give  that  a>6>0=»S' (6)  >S’ (a)  >0  wherein  (5.20)  yields  si  +  r2  -sf(,„  >0 
which  establishes  the  required  condition  in  (5.18)  for  the  case  Ay,  2Ay2>0.  The  required  condition  in 

(5.18)  can  be  established  in  a  similar  fashion  for  the  other  first  quadrant  case  Ay2  >  Ay,  >  0.  The  case  for 
the  third  quadrant  follows  in  a  similar  fashion. 

QED  Lemma  A 

z(kr)  j(*r) 

Now  since  — 1  <  sj  <1  with  sign(s\  )  =  sign( Ay,)  it  follows  from  (5.17)2  that 

rig/Kff  Ay,))  =  r/gfl(Ay,)  (5.21) 

tikr, 

while  (5.17)3  with  -l  <sCOn  <  1  gives 


(5.19) 


(5.20) 


Z(Ay, +  Ay2)  >0. 


(5.22) 


83 


The  next  lemma  addresses  the  functions  (Ay,.  Ay2)  and  w2  (Ay,,Ay2)  • 

Lemma  B.  If  Ay,  >0,Ay2>0  such  that  |s(Ay,  +  Ay2)|  <0.5,  then  w,  (Ay,,Ay2)  >0  and 
W2(Ay,,Ay2)  >0. 


If  Ay,  <0,Ay2<0  such  that 
1V2(Ay,.Ay2)  <0. 

Proof  of  Lemma  B.  We  write 


S(AYi 


■aY2)| 


<0.5,  then 


W\  (Ay,,Ay2)  <  0  and 


.  .  -  -(kr)  -(kr)  _<ir)  -(*«■) 

^i(AYi,Ay2)  =  ^,(11  .ico*  )  =  (So*  ) -Gj(j,  ). 

.  _  .  -(*<•)  ,(*r)  -•(*<■)  -(kr) 

1^2  (Ay,,Ay2)  =  lV2(Ifo"  1  =  1  ), 

and  note  that  both  G,  (*)  and  G2(s)  are  monotonically 
Ay,  >  0,  Ay2  >  0  such  that  js  (Ay,  +  Ay2)  |  <05  it  then  follows  that 


(5.23) 


G,  (s)  u(l+i), 

C2(i)  *  s  ( 1  -  s) , 

increasing  for  |j|<05.  Therefore  if 


-•(*>•)  -•(*')  -(*<•)  -f*'-)  ...• 

Ay,  +  Ay2 > Ay,  > 0  =» 0.5  > scolt  > s,  >0=*G,  (setm  )  >G,  (r,  )  =»  W\  (Ay,. Ay2)  > 0. 

.(*<■)  _<*r)  -(kr)  .(kr)  _  _ 

Ay,  +  Ay2 > Ay2 >  0  =>  05  > sCOH  > s2  >0=>G2(sCOH  )  >C2(i,  )  =>  w2(Ay,.AY2)  >0. 

Similarly  if  Ay,  <  0.  Ay2  <  0  such  that  | J  (Ay,  +  Ay2)  j  <  05  it  follows  that 


-(kn  -•(*»•)  ,<*')  -(kr)  .  . 

Ay,  +  Ay2 <  Ay,  < 0  =» -05 < sCOn  < *i  <0=»G,  (sc<>«  )  <G,  (s,  )  =»  W,  (Ay,.Ay2)  <0, 

.(kr)  -(kr)  j(kr)  j  (kr) 

Ay,  *- Ay2 <  Ay2 < 0  =» -05 < scon  < s2  <0=»G2(jco«  )  <G2(s,  )  =>  w2(Ay,.Ay2)  <0. 


(5.24) 


(5.25) 


QED  Lemma  B 

Therefore  (5.21),  (5.22)  and  Lemma  B  give,  under  the  hypotheses  of  the  Theorem,  that 

K(Ay,)W,  (Ay,,Ay2)  >0,  y(Ay2!Z(Ay,  +  Ay2)W2(AY,.Ay2)  >0.  (5.26) 

This  result,  in  conjunction  with  Lemma  A  and  (5.15),  then  establishes  the  Theorem. 

QED 

;(kr) 

In  Table  3  we  display  the  phase  boundary  speed  scon  corresponding  to  the  k  =  10  case  for  which 

<t>  was  previously  given  in  Table  2.  Examination  of  the  bold  entries  of  these  two  tables  indicates 
numerical  consistency  with  Theorem  2.  In  addition,  it  is  also  clear  that  the  converse  to  Theorem  2 
does  not  hold. 

6.  Energy  Loss  for  the  General  Concurrent  Pulse  Problem  Governed  by  the 
Linear  Kinetic  Relation 

The  energy  losses  for  the  (rr),  (/r),  and  (/0-cases  can  be  determined  in  a  corresponding  way. 
The  energy  loss  differences  analogous  to  T(r/)  for  these  other  cases  are  found  to  be  given  by 


-(*r)  -  » 
Table  3.  Values  of  scon  for  k  =10. 


-0.600 

111331 

0.000 

0.200 

0.400 

0.600 

0.800 

1.000 

1.000 

0.000 

0.307 

0.513 

0.637 

0.714 

0.765 

0.801 

0.828 

0.849 

0.865 

0.878 

0.800 

-0.307 

0.000 

0.307 

0.513 

0.637 

0.714 

0.765 

Em 

0.828 

0.849 

0.865 

0.600 

-0.513 

0.000 

0.307 

0.513 

0.637 

0.714 

0.765 

0.801 

0.828 

0.849 

0.400 

-0.637 

BBE1 

-0.307 

0.000 

0.307 

0.513 

0.637 

0.714 

0.765 

0.801 

0.828 

0.200 

-0.714 

SfflM 

-0.513 

iSEfl 

0.000 

0.307 

0.513 

0.637 

0.714 

0.765 

0.801 

0.000 

-0.765 

sm 

-0.637 

Rill 

-0.307 

0.000 

0.307 

0.513 

0.637 

0.714 

0.765 

-0.801 

BEa 

-0.714 

-0.307 

0.000 

0.307 

0.513 

0.637 

0.714 

E  ] 

-0.828 

-0.765 

mm 

§123 

-0.513 

-0.307 

0.000 

0.307 

0.513 

0.637 

Bigg 

-0.801 

Ml 

-0.637 

-0.513 

-0.307 

0.000 

0.307 

0.513 

EE3 

B 

-0.828 

BED 

-0.714 

HE19 

-0.513 

-0.307 

0.000 

0.307 

na 

B 

iilBl 

mm 

gitlW 

-0.637 

-0.513 

-0.307 

0.000 

(rr): 

c*la%-yj2  -  -  - 

T(rr)  »  - |  -  4'(AY,,Ay2)  . 

Ur): 

c3'c(7*~V2  -  -  - 

*(,„  =  - 4  -  'V  (Ay, ,  Ay2) , 

(6.1) 

(« ): 

T(f0  =  - %  -  4>  (Ay, . Ay2) . 

with 


*  (Ay,,  Ay2)  =  4>  (-  Ay*.  -  Ay,) .  (6.2) 

lq(c  +  s2r) ) 

In  the  (/r)-case,  another  projected  time  t  =  t  + - — —  has  been  introduced. 

c(c-s2) 

In  particular,  4»(AyltAy2)  and  4/(AY1,Ay2)  are  each  negative  in  the  second  and  fourth  quad¬ 
rants,  whereas  neither  sign  is  excluded  in  the  first  and  third  quadrants.  Thus  we  now  summarize 
our  findings  for  the  concurrent  pulse  problem  with  phase  boundary  speed  governed  by  the  linear 
kinetic  relation  (3.4): 

The  concurrent  pulse  encounter  suffers  the  lesser  energy  loss  in  the  event  that  both  incoming 
pulses  are  of  opposite  sign,  whereas  knowledge  that  the  incoming  pulses  are  of  the  same  sign  is 
not  sufficient  to  conclude  that  the  concurrent  pulse  encounter  suffers  the  greater  energy  loss. 
However  if  the  incoming  pulses  are  of  the  same  sign,  and  each  is  sufficiently  small,  then  the  con¬ 
current  pulse  encounter  suffers  the  greater  energy  loss. 
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Finally,  we  recall  from  Section  2  that  we  have  not  seriously  considered  the  effect  of  the 
additional  restrictions  upon  *he  phase  boundary  velocities  which  are  necessary  to  ensure  that  the 
strain  values  in  (2.5)  remain  in  the  segregation  intervals  associated  with  uieir  respective  phases.  In 
particular,  we  have  not  explicitly  constructed  an  example  in  which:  the  incoming  pulses  are  of  the 
same  sign,  the  concurrent  pulse  suffers  the  lesser  energy  loss  (implying  by  Theorem  2  that 


scon 


S0.5),  and  i,(tr)  e  3,,  C*  €  3^,  s?n  e  3 


(kr) 


■  < kr ) 


2’ 
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A  FUNCTION  WHOSE  VALUES  ARE  INTEGERS,  II 
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Abstract:  In  this  paper  we  prove  the  following: 

For  any  value  of  k,  there  exists  a  value  of  n,  for  which 


is  an  integer. 

Introduction.  In  [1],  using  determinants  of  certain  triangular  matrices,  the  foliowing  was  proved. 
Theorem.  For  any  positive  integers  k  and  n. 


is  an  integer. 

The  proof  is  repeated  here.  For  any  integers  k  and  w,k>  0,  notice  that 


’  Dr,  Luchins  spent  the  1991-1992  academic  ye.'':  as  ihe  Distinguished  Visiting  Professor  of  Mathematics  at  the 
United  States  Military  Academy  and  has  sir_e  returned  to  Rensselaer  Polytechnic  Institute 
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(2)  (a) 


(w+D^-vv2*^  1  +  X 


zr2k+i\ 


rm  l 


W 


ik+i-r 


J 


(b) 


where 


(w- l)2**1  -  w2t+l  =  -i  +  X(-ir 


r»l 


'2*+r 


w 


U+l-r 


<P}-  P] 
UJ  q'Xp-q)'- 


(<7  =  0,1 . p). 


For  any  positive  integers  m  and  rt,  let  S(m,n)  =  1*  +2"  4 —  +  nm.  If  in  (2a)  we  let  w  =  0,1, 2, 
and  add,  we  obtain 


(3) 


(rt  +  1)2**1  = 


<  ~i  l  r  j 


S(2k  +  \-r,n) 


+  n  + 1. 


Just  as  we  found  (3)  from  (2a),  from  (2b)  we  get 


(4) 


(5) 


where 


Replacing  k  by  j  and  subtracting  (4)  from  (3),  we  obtain 

SU-iJ 


S(2j  +  2~2r,n), 


(6) 


a2j+{=(n  +  \)2J*'+n1J+l-(2n  +  \). 


Next,  we  consider  the  k  equations  obtained  from  (5)  by  successively  replacing;  by  1,2,  With 
/  =  1, 2,  •  ■  k, these  k  equations  in  the  k  unknowns  2J(2l,n)  can  be  solved  by  Cramer’s  rule  to  obtain 

(7)  2(Dl)S(2k,n)  =  (-\)k*i  D2> 
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where  D,  and  D2  are  the  determinants  given  below: 


29 


Since  Dj  is  the  determinant  of  a  triangular  matrix,  it  is  simply  the  product  of  the  diagonal  entries.  Hence 

A  =  II(2r  +  l)  =  (2*  +  l)!/2**!. 

r— l 

Equation  (7)  then  becomes 

(8)  60^V(2*.n)=D2. 

To  complete  the  proof  it  suffices  to  show  that  D2  is  a  multiple  of  6S(2,n).  This  will  be  so  if  every  entry  in  a 
column  of  D2  is  such  a  multiple. 

To  this  end  let  u  =  65(2, n)  =  n(n  +  l)(2n+ 1).  Then  we  show  that  for  any  positive  integer  j 

(9)  tjj/+1  ss  0  (mod  u), 

which  shows  that  u  divides  every  entry  in  the  first  column  of  Dj.  To  reduce  the  problem  still  further,  notice  that 
n,n  + 1,  and  2rt  + 1  are  relatively  prime  in  pairs,  so  that  the  congruence  (9)  is  equivalent  to  the  three 

congruences  CLll+{  m  0(mod  m),  (m  =  n,  n  + 1, 2n  + 1).  Although  our  notation  has  hidden  the  fact  that  dlnX 
depenos  on  n,  equation  (6)  shows  that  these  congruences  become 


(n  +  1)2**1  *l(modrt), 

(n+1)2**1  +nliH  »0(mod2/i+l), 
n2*+I  =  n(modn  +  l), 

which  are  easily  verified.  This  completes  the  proof  of  (1). 

Proof  of  (A).  It  is  evident  that  the  denominator  in  (A)  equals  — — - - -  =  D.  In  (1),  it  is  easy  to  show 


(2*  +  l). 
3.2* /t! 


,f(2*  +  l)!l  ^  ,f  „ 

-  and  2n  +  l  =  3 

"2(2*  + l)f 

.  2  +  1  J 

.  3  +  1 

r 

-  .  Now,  it 


is  evident 


Let  n-(2k  + 1)!+1  so  that  n  +  1 
that  the  only  common  factor  of  D  and  J  is  1,  which  completes  the  proof  of  (A). 
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REVERSE  DISIT  CONSTRUCTIONS  OF  PERFECT,  MAGIC,  AND 


DOUBLY  MASIC  CUBES 
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West  Point,  NY  10996-1786 
and 
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ABSTRACT 

In  this  paper  we  introduce  the  concept  of  reverse  digit 
pairs.  We  then  exhibit  an  orthogonal,  perfect,  magic  and 
doubly  magic  4-cube  of  order  8,  with  the  property  that  each 
row  contains  reverse  digit  pairs.  In  addition,  orthogonal 
cubes  with  this  reverse  digit  property  are  discussed  and  in 
particular  an  orthogonal,  reversed  digit  4-cube  of  order  10 
is  presented. 


NOTE. 


Since  the  actual  construction  of  the  k-cubes  requires 

thousands  of  numbers,  we  decided  to  send  the  22  pages  of 

tables,  separately,  to  anyone  who  is  interested. 

Pease  send  vour  request  for  the  tables  to: 

Professor  Joseph  Arkin 
Dept,  of  Math.  Sci. 

United  States  Military  Academy 
West  Point,  NY  10996-1 7B6 
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A  Few  Brig ±  Historical  Nqtes  fin  Pfrfggj  &A&S1. 

(1)  In  1888,  the  first  perfect  magic  cube  ever  constructed  mas  of  or  .er  e, 
and  Mas  placad  in  Thm  Hamoir s  of  thm  National  Acadray  of  Scimncm"  Cll. 

(2)  Martin  Gardnar  defines  a  parfact  magic  Cuba  as  follows* 

HA  parfact  magic  Cuba  is  a  cubical  array  of  positiva  integers  from 
1  to  N3 

such  that  avary  straight  lina  of  N  calls  adds  up  to  a  constant,  rhasa  iinas 
include  tha  orthogonals  (tha  linas  parallel  to  an  edga) ,  the  two  main 
diagonals  of  ovary  orthogonal  cross  soction  and  tha  four  spaca  diagonals. 
Tha  constant  is 

(1+2+3-*-. .  .+N3)/N2  -  J4<N^*N)  “  C23. 

(3)  E.G.  Straus,  in  1976,  in  a  privato  lattar  to  Arkin,  dascribad  hOM  ha 
constructad  a  7x7x7  parfact  magic  cuba.  This  may  be  tha  lowest  possibla 
order  of  a  parfact  Latin  3-cube  C33. 

(4)  In  198S,  Arkin  superimposed  6  orthogonal  Latin  cubes  of  order  7  to 
form  20  separate  Latin  3-cubes  £43. 

(5)  A  parfact  4-dimensional  hypercube  of  order  7  was  constructad  at  West 
Point  in  1989  C5D. 

<6)  A  perfect .  associativa  and  doubly  magic  Suoarcuba  was  constructad  at 
West  Point  in  1990  £63. 

Latin  k-cyflt  2f  order  n. 

A  Latin  square  of  order  n  is  an  nxn  square  in  which  each  of  the 
numbers  0,l,...,n-l  occurs  exactly  once  in  each  row  and  exactiv  once  in 
each  column.  For  example 

01  012  0123 

10  120  1230 

201  2301 

3012 

are  the  Latin  squares  of  order  2,3,4,  respectively.  Two  Latin  squares  of 
order  n  are  orthogonal,  whan  ana  is  super  imposed  on  another,  every  or dared 
pair  00,01 ,..., n-1  n-1  occurs.  Thus 

012  012  00  11  22 

120  and  201  superimpose  to  12  20  01 

201  120  21  02  10 

and  therefore  are  orthogonal  squares  of  order  3.  A  set  of  Latin  squares  of 
orders  n  is  orthogonal  if  every  two  of  them  are  orthogonal.  As  an  example 
the  4x4  square  of  triples 

000  111  222  333 
123  032  301  210 
231  320  013  102 
312  203  130  021 


02 


represents  three  mutually  orthogonal  squares  of  order  4  since  each 
of  the  16  pairs  00, 01,..., 33  occurs  in  each  of  the  three  possible 
positions  among  the  16  triples. 

We  can  generalize  all  these  concepts  to  nxnxn  cubes  and  cubes 
of  higher  dimensions.  A  Latin  cube  of  order  n  is  an  nxnxn  cube  (n 
rows,  n  columns  and  n  files)  in  which  the  numbers  0,1,..., n-1  are 
entered  so  that  each  number  occurs  exactly  once  in  each  row, 
column  and  file.  If  we  list  the  cube  in  terms  of  the  n  squares  of 
order  n  which  form  its  different  levels  we  can  list  the  cubes 

01  10  012  120  201 
10  01  and  120  201  012 
201  012  120 

as  Latin  cubes  of  order  2  and  3,  respectively. 

Orthogonality  of  Latin  cubes  is  the  following  relation  among 

three  Latin  cubes:  three  Latin  cubes  of  order  n  are  orthogonal  if, 

when  superimposed,  each  ordered  triple  000, 001 , . . . , n-1  n-1  n-1 

will  occur.  For  example  the  pair  of  3:<3  Latin  squares 
00  11  22 
12  20  01 
21  02  10 

leads  to  the  four  3x3x3  cubes 


012 

120 

201 

a: 

120 

201 

012 

201 

012 

120 

012 

120 

201 

b: 

201 

012 

120 

120 

201 

012 

021 

102 

210 

c : 

210 

021 

102 

102 

210 

021 

021 

102 

210 

d: 

102 

210 

021 

210 

021 

102 

Superimposed  these  lead  to  a  cuoe 
with  I  over  11  over  III. 


of  quadruples  in  tnree  levels 
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abed: 


0000  1122  2211  1111  2200  0022  2222  0011  1100 

1221  2010  0102  2002  0121  1210  0110  1202  2021 

2112  0201  1020  0220  1012  2101  1001  2120  0212 

I  II  III 

where  each  ordered  triple  occurs  in  every  one  of  the  four  possible 
positions  in  the  quadruples. 

Note.  We  define  a  cube  of  triples  (say  abc)  where  each  ordered 
triple  occurs  in  some  order  in  the  27  cells  of  the  three  levels  (I 
over  II  over  III)  of  the  cube  abc  as  a  Let  in  5-cube  of  order  5. 

In  this  paper  we  have  constructed  an  orthogonal  4-cube  of 
order  8.  This  construction  consists  of  eight  Latin  4-cubes. 

To  individually  construct  each  one  of  the  eight  cubes,  we 
superimpose  4  orthogonal  Latin  cubes  of  order  8.  Now,  each  one  of 
the  resulting  4096  cells  throughout  the  construction  contains  four 
digits,  where  each  ordered  quadruple  (0000, 0001 ,..., 7777)  occurs 
only  once  in  every  cell. 

A  Perfect  Construction. 

Each  of  the  eight  Latin  4-cubes  is  perfect  in  the  following 
way  (we  consider  only  one  of  the  eight  cubes  at  a  time):  the  sum 
(31108)  of  the  elements  in  each  minor  diagonal  is  equal  to  the  sum 
of  the  elements  of  a  row  in  each  of  the  2  directions  in  each  of 
the  respective  sauares  (layers)  that  make  up  each  Latin  4-cube  of 
order  8.  The  sum  (31108)  of  the  elements  of  a  row  in  each 
direction  of  a  cube  is  equal  to  the  sum  of  the  elements  in  each  of 
the  4  major  diagonals  and  the  sum  on  all  the  diagonals  of  the  cube 
is  the  same  (namely  31108).  The  sum  (31108)  of  each  of  the  eight 
major  space  diagonals  throughout  the  supercube  is  the  same. 

The  construction  of  the  cube  is  based  on  the  3  orthogonal  cubes 
=  x,  +  2xj  -  3xw , 

A‘Tj £  =  ~  2xj  -  3xu. 

Ai  ik  —  x  i  ox,  "T  2Xk  * 

where  (x,,...,xa)=(0,l,...,y)  and  arithmetic  is  (mod  8). 
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Prgpa.ciiM 


By  labeling  the  -four  digits  in  each  quadruple  ABCD,  new 
entries  tor  the  4-cube  can  be  formed  in  each  and  every  one  of 
the  64  squares,  m  columns ,  by  the  following  six  three-digit 
combinations  -  ABA,  BAB,  BOB,  DBD,  CDC  and  DCD.  With  these 
new  entries,  each  of  the  64  squares  in  the  4-cube  not  only 
has  in  each  column  the  magic  sum  of  3108  in  base  10,  but  also 
*  doubly  magic  sum.  That  is  by  squaring  each  new  entry  (we 
consider  the  squares  that  are  made  by  using  the  six  column 
entries  -  ABA,  BAB,  BDB,  DBD,  CDC  and  DCD  instead  of  ABCD, 
the  new  entries)  in  base  10  and  summing  the  eight  appropriate 
numbers,  we  obtain  1,640,100. 

Super  Doublv-Maoic  Propert ies. 

Considering  one  square  at  a  time  (in  what  follows,  we 
use  all  four  digits  in  each  cell): 

a)  the  sum  of  each  of  the  8  columns  and  each  of  the  2 
mini-diagonals  are  the  same 

b)  squaring  each  of  the  four  digits  in  each  cell,  we  find 
that  the  sum  of  each  of  the  8  columns  and  each  of  the  2 
mini-diagonals  are  the  same. 

Kever sea  j l q 1 1  uonst rue 1 1 on . 

In  the  constructions  that  follow,  each  and  every  row 
throughout  the  4-cube  contains  reversed  digit  pairs.  For 
example: label  the  4  digits  in  any  cell  wxyz,  then  on  the  same 
row  there  will  always  be  another  cell  with  four  digits 
labeled  xwzy. 

Note.  Each  of  the  10  sums  in  (b)  above  may  vary  from  square 
to  square. 
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In  the  4  pages  that  follow  we  have  exhibited  the 
following  construct ions: 

I.  In  this  table,  we  list  the  complete  reversed  digit  and 
magic  construction  of  a  Latin  4-cube  of  order  3.  The  81 
numbers  (0000  through  2222),  are  distinct  from  each  other. 

II.  In  this  table,  we  display  cube  i  of  a  Latin  4-cube  of 
order  4,  which  is  made  up  of  4  cubes,  cube  1,  ...»  cube  4. 
Each  of  the  4  cubes  contain  64  distinct  four  digit  numbers 
and  each  of  the  4  cubes  is  made  up  of  reversed  digit  and 
mao ic  constructions.  The  complete  Latin  4-cube  of  order  4 
consists  of  256  distinct  four  digit  numbers,  0000  through 
3333. 

III.  In  this  table,  we  display  cube  1/of  a  Latin  4-cube  of 
order  8,  which  is  made  up  of  8  cubes,  cube  1,  ...,  cube  8. 
Each  of  the  8  cubes  contain  512  distinct  four  digit  numbers 
and  each  of  the  8  cubes  is  made  up  of  reversed  digit. 
perfect .  magic  and  doubly  magic  constructions.  The  complete 
Latin  4-cube  of  order  8  consists  of  4096  distinct  four  digit 
numbers,  0000  through  7777. 

IV.  In  this  table,  we  display  cube  1  of  a  Latin  4-cube  of 
order  10,  which  is  made  up  of  10  cubes,  cube  1,  ...,  cube  10 
Each  of  the  10  cubes  contain  1000  distinct  four  digit  number 
and  each  of  the  10  cubes  is  made  up  of  reversed  digit  and 
magic  constructions.  The  complete  Latin  4 -cube  of  order  10 
consists  of  10000  distinct  four  digit  numbers,  0000  through 


9999. 


I. 


0000 

1  22  1 

2 1  1 2 

1122 

20 1  0 

0201 

221 1 

0102 

1  020 

CUBE 

1  SQUARE  1 

1111 

2002 

0220 

2200 

0121 

1012 

0022 

1 2  i  Ci 

2 1 0 1 

CUBE 

1  SQUARE  2 

~>r-22 
00 1 1 

1 100 
CUBE 

0110  1 00 1 
1202  2120 
2021  0212 
1  SQUARE  3 

1112 

2000 

0221 

2201 

0122 

1 0 1 0 

0020 

1211 

2102 

CUBE 

2  SQUARE  1 

2220 

0111 

1 002 

0012 

1 200 

2 1 2 1 

1101 

2022 

02 1 U 

CUBE 

2  SQUARE  2 

0001 

1 222 

2110 

1  120 

20 1 1 

0202 

22 1 2 

0 1 00 

1021 

CUBE 

2  SQUARE  3 

-  —  —  j. 

X  l  *. 

00  1 0 

1201 

2122 

1102 

2020 

021 1 

CUBE 

3  SQUARE  1 

0002 

1220 

21  11 

1121 

2012 

0200 

2210 

0 1 0 1 

1022 

CUBE 

3  SQUARE  2 

1110 

2001 

0222 

0 1 20 

1011 

002 1 

1212 

2 1 00 

CUBE 

3  SQUARE  3 

98 


r  * 


0000 

2233 

33 1 1 

1 122 

2332 

0101 

1023 

3210 

3113 

1320 

0202 

203 1 

1221 

3012 

2130 

0303 

CUBE 

1  SQUARE  1 

2321 

0112 

1030 

3203 

0013 

2220 

3302 

1131 

1232 

3001 

2123 

0310 

3100 

1333 

0211 

2022 

CUBE 

1  SQUARE  2 

3132 

1301 

0223 

2010 

1200 

3033 

2111 

0322 

0021 

2212 

3330 

1103 

2313 

0120 

1002 

3231 

CUBE 

1  SQUARE  3 

1213 

3020 

2102 

0331 

3121 

1312 

0230 

2003 

2300 

0133 

1011 

3222 

0032 

2201 

3323 

1110 

CUBE 

1  SQUARE  4 

QO 


Ill 


0206 

4617 

1360 

5771 

3124 

7535 

2042 

6453 

6063 

2472 

7105 

3514 

5341 

1750 

4227 

0636 

6043 

2452 

7125 

3534 

5361 

1770 

4207 

0616 

0226 

4637 

1340 

5751 

3104 

7515 

2062 

6473 

5611 

1700 

4777 

0366 

6533 

2122 

7455 

3044 

3474 

7065 

2512 

6103 

0756 

4347 

1630 

5221 

3454 

7045 

2532 

6123 

0776 

4367 

1610 

5201 

5631 

1220 

4757 

0346 

6513 

2102 

7475 

3064 

4537 

0126 

5451 

1040 

7615 

3204 

6773 

2362 

2752 

6343 

3634 

7225 

1470 

5061 

0516 

4107 

2772 

6363 

3614 

7205 

1450 

5041 

0536 

4127 

4517 

0106 

5471 

1060 

4635 

3224 

6753 

2342 

1120 

5531 

0046 

4.457 

2202 

6613 

3364 

7775 

7345 

3754 

6223 

2632 

4067 

0476 

5101 

1510 

7365 

3774 

6203 

2612 

4047 

0456 

5121 

1530 

1100 

5511 

0066 

4477 

2222 

6633 

3344 

7755 

CUBE  1  SQUARE  1 

CUBE  1  SQUARE  5 

1570 

5161 

0416 

4007 

2652 

6243 

3734 

7325 

7715 

3304 

6673 

2262 

4437 

0026 

5551 

1140 

7735 

3324 

6653 

2242 

4417 

0006 

5571 

1160 

1550 

5ifi 

0436 

4027 

2672 

6263 

3714 

7305 

4167 

0576 

5001 

1410 

7245 

3654 

6323 

2732 

2302 

6713 

3264 

7675 

1020 

5431 
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ANALYTIC  ROOTS  OF  THE  PERIOD  THREE  QUADRATIC  RECURSION  POLYNOMIAL 


Harry  J.  Auvermann 

U.  S.  Army  Atmospheric  Sciences  Laboratory 
White  Sands  Missile  Range,  New  Mexico  88002-5501 


ABSTRACT .  This  paper  is  concerned  with  stable  points  of  iterates  of  the 
function  F(d,z)  -  d  -  z2.  The  number  of  these  stable  points  changes  as  the  real 
parameter  d  varies  from  -1/4  to  2.  The  number  of  stable  points  is  termed  the 
period.  Period  one  stable  points  are  roots  of  the  polynomial  that  result  from 
substituting  z  for  F(d,z)  in  the  above.  Two  applications  of  F(d,z)  produce  a 
fourth  order  polynomial.  Period  two  stable  points  are  roots  of  this  polynomial 
that  are  easily  obtained.  Three  applications  of  F(d,z)  produce  an  eighth  order 
polynomial.  Period  three  stable  points  are  the  roots  of  this  polynomial.  Two 
of  these  roots  are  known  from  analysis  of  the  lower  iterates.  Solution  of  a 
sixth  order  polynomial  then  determines  the  period  three  stable  points.  An 
analytic  solution  to  the  period  four  recursion  polynomial  was  reported  at  last 
year's  conference.  We  apply  the  method  of  the  former  paper  to  the  period  three 
case  and  show  how  the  application  must  be  changed  for  a  period  that  is  a  prime 
number . 

INTRODUCTION.  Motivation  for  this  work  has  been  covered  in  a  previous 
paper  (Auvermann,  1992a).  Briefly,  the  interest  arises  because  transition  from 
order  to  disorder,  similar  to  the  transition  of  a  fluid  from  laminar  to  turbulent 
flow,  has  been  observed  in  mathematical  expressions  such  as  one -dimensional  maps, 
an  example  being  the  recursion  expression 

(1)  zk+1  -  d  -  zk2. 

The  parameter  d  in  the  mathematical  process  corresponds  to  the  Reynolds  number 
in  the  fluid  flow  process.  Corresponding  to  the  random-like  samples  of  the  local 
velocity  in  the  flow  are  the  iterates  zk  of  the  mathematical  process.  If  d  is 
less  than  a  certain  value,  called  the  accumulation  point,  stable  points  occur 
(Feigenbaum,  1978).  Stable  points  are  repeating  numbers  in  the  sequence  zk. 
This  sequence  is  termed  a  limit  cycle,  and  the  number  of  points  in  the  cycle  is 
termed  the  period.  If  d  is  larger  than  the  accumulation  point,  there  are 
isolated  intervals  wherein  stable  points  occur  (Berge* ,  1984).  Between  these 
isolated  intervals  are  intervals  of  chaos  similar  to  fully  developed  turbulence. 
That  is,  the  iteration  sequence  never  repeats  itself  and  the  values  depend  upon 
the  starting  point.  This  similarity  between  iterate  sequences  and  random 
processes  is  the  reason  for  the  intense  interest  in  the  mathematics  of  one¬ 
dimensional  maps  and  limit  cycles. 

The  condition  where  each  point  is  the  same  stable  point  is  analyzed  by  substitu¬ 
tion  of  zk  Tor  zk+1  on  the  left  side  of  equation  (1)  and  solving  for  the  roots  of 
the  resulting  polynomial.  Bifurcation  occurs  here  in  the  sense  that  for  larger 
values  of  d,  the  stable  points  repeat  every  second  iteration.  This  sequence  of 
two  is  termed  a  period  two  limit  cycle.  The  former  case  is  termed  a  period  one 
limit  cycle.  In  this  paper  our  attention  will  be  confined  to  period  three  limit 
cycles.  Prior  work  on  period  three  stable  points  and  the  general  applicability 
of  quadratic  recursion  has  been  covered  in  the  former  paper  (Auvermann,  1992a). 
Abel  (1829)  has  shown  that  solutions  of  polynomials  of  this  type  can  be  rtduced 


103 


to  the  solution  of  polynomials  whose  order  is  the  same  as  the  period.  Abel  also 
gave  a  method  by  which  all  such  lower  order  polynomials  may  be  solved.  Netto 
(1898)  has  shown  how  the  method  of  Lagrange  resolvants  can  be  used  to  solve  for 
period  three  and  period  four  roots . 

The  new  results  we  report  are  an  alternate  method  for  obtaining  the  roots  of  a 
period  three  quadratic  recursion  polynomial.  This  method,  having  only  been 
applied  previously  (Auvermann,  1992a)  to  period  four  and  now  to  period  three, 
is  not  as  general  as  the  methods  of  Abel  and  Netto.  However,  it  is  simpler 
than  that  of  Netto  and  serves  to  accomplish  the  reduction  of  the  full  recursion 
polynomial  to  the  polynomials  solved  in  general  by  Abel.  Some  comments  on 
extension  of  our  method  co  other  periods  are  contained  in  the  conclusions  section 
below.  Much  of  the  algebra  necessary  to  show  the  results  presented  here  has  been 
left  out.  More  details  are  contained  in  a  companion  government  report 
(Auvermann,  1992b) . 


The  following  list  contains  the  essential  elements  of  the  remaining  notation 
used.  The  symbol  n  is  used  for  the  period  of  a  particular  limit  cycle. 

Pn(d,z)  -  period  n  recursion  polynomial  of  order  2n 

En(d,z)  -  a  factor  of  Pn(d,z)  such  that  Pn(d,z)  -  P1(d,z)  E^d.z) 

Z(d,n,m)  -  mth  (m  -  1,  2 , .  .  .  2n)  stable  point  of  Pn(d,z) 

Dn  -  threshold  of  d  where  the  roots  of  Pn(d,z)  become  real 

j  -  (-1)1'2. 

E„(d,z)  is  called  here  the  reduced  polynomial.  For  periods  that  are  not  prime 
numbers,  roots  of  some  lower  period  polynomials  are  also  roots  of  the  higher 
period  polynomial.  When  all  of  these  lower  order  polynomials  are  factored  out, 
the  remaining  polynomial  is  called  the  primitive  polynomial.  Using  period  eight 
as  an  example,  the  primitive  polynomial  is  defined  as 


Ha(d,z)  -  a  factor  of  P8(d,z)  such  that  P8(d,z)  -  P<,(d,z)  Ha(d,z). 

Primitive  polynomials  are  discussed  briefly  in  the  conclusion  section.  When  the 
period  is  a  prime  number,  the  primitive  and  reduced  polynomials  are  identical. 

EXPRESSIONS  ASSOCIATED  WITH  PERIODS  ONE.  TWO.  AND  THREE.  In  this  section, 
the  expressions  for  the  polynomials,  stable  points,  and  the  thresholds  for 
periods  one,  two,  and  three  will  be  developed.  The  first  step  is  to  write  out 
the  corresponding  stable  point  polynomials. 

For  a  beginning  value  of  the  variable,  z0,  and  a  given  parameter,  a  series  of 
iterates  zk  is  produced  by  repeated  application  of  equation  (1).  If  d  is  greater 
than  Dx,  zk  approaches  a  fixed  point  Z(d,l,m)  as  k  increases.  This  stability 
occurs  when  zk+1  is  equal  to  zk  in  equation  (1).  The  values  of  z  that  satisfy 
this  condition  are  the  roots  of  the  period  one  polynomial 

(2)  Px(d, z)  -  z2  +  z  -  d. 
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where  the  serial  number  k  has  been  dropped  for  writing  economy.  The  period  two 
polynomial  is  obtained  by  developing  the  expression  for  the  iterate  two  later  in 
the  sequence .  Hence , 

(3)  P2(d,z)  -  (z2  -  d)2  +  z  -  d. 

From  equations  (2)  and  (3),  one  has 

(4)  P2<d,z)  -  P1(d.z)[P1(d,z)  -  2z  +  1], 

(5)  P2(d,z)  -  P^d.z)  E2(d, z)  . 

Equations  (4)  and  (5)  serve  to  define  reduced  polynomial  E2(d,z).  The  period 
three  polynomial  is  obtained  in  a  similar  manner  by  developing  the  expression  for 
the  iterate  three  later  in  the  sequence.  Hence, 

(6)  P3(d,z)  -  [(z2  -  d)2  -  d]2  +  z  -  d. 

From  equations  (2)  and  (6) 

P3(d,z)  -  P1(d,z){  [PiCd.z)]3  -  2(2z  -  DtP^d.z)]2 

(7)  +  (4z2  -  6z  -  DP^d.z)  -  2z(2z  -  1)  +  1), 

(8)  P3(d,z)  -  P^d.z)  E3(d,z) . 

Equations  (7)  and  (8)  serve  to  define  reduced  polynomial  E3(d,z)  and  show  that 
P1(d,z)  is  a  factor  of  P3(d,z). 

Roots  of  polynomial  P^d.z)  of  equation  (2)  are  given  by 

(9 )  Z(d, 1,1)  -  1/2 { -1  +  (1  +  4d)1/2] , 

(10)  Z(d, 1,2)  -  l/2[-l  -  (1  +  4d)1/2] . 

Z(d,l,l)  is  a  stable  fixed  point  and  Z(d,l,2)  is  an  unstable  fixed  point 
(Feigenbaum,  1983).  From  equation  (8)  we  see  that  two  roots  of  the  period  three 
polynomial  P3(d,z)  are  the  same  as  the  roots  of  P1(d,z).  The  remaining  period 
three  roots  are  obtained  by  solving  E3(d,z)  from  equation  (7).  The  period  three 
reduced  polynomial  fully  expanded  is 

E3 ( d , z )  -  z6  -  z5  +  (1  -  3*d)*z*  -  (1  -  2*d)*z3 

+  (1  -  3*d  +  3*d2)*z2  -  (1  -  2*d  +  dz)*z 

(11)  +  (1  -  d  +  2*d2  -  d3)  . 

The  thresholds  will  now  be  considered.  For  equations  (9)  and  (10)  to  be  real, 
the  radical  must  be  nonnegative.  This  condition  on  d  defines  the  period  one 
threshold. 

(12)  Dx  -  -1/4. 


105 


Similar  conditions  define  the  thresholds  for  period  two  and  period  four 
(Auvertnann,  1992a)  as  D2  -  3/4  and  D*  -  5/4.  The  accumulation  point  has  been 
determined  numerically  (Berge' ,  1984),  which  is  D«  -  1.4011519  when  transformed 
to  the  parameter  of  recursion  relation  (1).  The  period  three  threshold,  not 
apparent  from  equation  (11),  has  been  determined  numerically  to  be 

(13)  D3  -  7/4. 

In  the  next  section  we  show  how  this  number  arises  in  the  algebra  of  the  root 
determination  process.  Thus,  the  period  three  interval  (where  the  polynomial 
roots  are  real  and  are  stable)  begins  above  the  accumulation  point  at  7/4. 

PERIOD  THREE  ROOT  EXPRESSIONS.  The  objective  is  to  find  analytical 
expressions  for  the  six  roots  of  E3(d,z).  It  is  instructive  to  investigate  the 
root  behavior  when  d  is  zero  to  determine  how  the  roots  are  connected  to  one 
another.  Under  the  condition  d  -  0,  equation  (6)  becomes 

(14)  P3(0 , z)  -  z8  +  z  -  0. 

From  equation  (14)  we  see  that  zero  is  one  root.  The  other  roots  are  determined 
from  z7  +  1  -  0  and  are 

Z(0 , 3 , 1)  -  0  Z(0 , 3,2)  -  -1, 

(15)  Z,(0)  -  Z(0 ,2,2  +  2)  -  ej(2<  _  7>(*/7),  2-1,  2 _ 6. 


In  equation  (15)  we  have  used  a  more  convenient  index  2  and  will  continue  so 
doing  in  the  remainder  of  this  paper.  By  applying  equation  (1)  to  Zx  in 
succession,  we  find  the  sequence  is  Zlt  Z2,  Z*,  Z1(  etc.  By  applying  equation  (1) 
to  Z6  in  succession,  we  find  the  sequence  is  Z6,  Z5,  Z3,  Z6,  etc.  For  this  case 
(d  -  0),  the  two  sequences  correspond  in  conjugate  pairs.  Because  the  roots  are 
continuous  functions  of  d,  we  write  the  relationships  for  the  roots  in  general 
as 


(16) 

Z2(d)  -  d  - 

[Zi(d)]2 

Z5(d)  -  d  - 

[Z6(d)]2 

(17) 

Z4(d)  -  d  - 

[Z2(d)]2 

Z3(d)  -  d  - 

[Z5(d))2 

(18) 

Zx(d)  -  d  - 

[Z*(d)]z 

Z6(d)  -  d  - 

[Z3(d)]2 

At  this  point  we  apply  the  method  that  was  successful  in  solving  the  period  four 
polynomial  (Auvermann,  1992a),  but  in  a  different  way.  In  period  four  for  d  - 
0,  the  roots  are  in  conjugate  pairs  within  the  same  sequence.  Here  they  are  in 
conjugate  pairs  across  sequences.  The  method  is  a  change  of  variables 


essentially 

We 

write 

(19) 

Zi  - 

A  +  B 

Z6  -  A  - 

B, 

(20) 

z2  - 

C  +  D 

Z5  -  C  - 

D, 

(21) 

z*  « 

E  +  F 

Z3  -  E  - 

F. 
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Here,  A,  B,  C,  D,  E,  and  F  are  the  new  variables,  each  a  function  of  d.  B,  D, 
and  F  are  imaginary  numbers  for  0  >  d  >  D3,  but  will  change  to  real  above  D3. 
They  were  written  as  a  general  number  because  they  will  have  the  same  functional 
form  throughout  the  range  of  the  d  parameter .  Therefore ,  the  functional 
dependence  has  been  dropped  to  provide  additional  writing  economy.  Rewriting 
equations  (16),  (17),  and  (18)  in  the  new  variables,  we  have 

(22)  C  +  D  -  d  -  A2  -  2AB  -  B2  C  -  D  -  d  -  A2  +  2AB  -  B2, 

(23)  E  +  F  -  d  -  C2  -  2CD  -  D2  E  -  F  -  d  -  C2  +  2CD  -  D2 

(24)  A  +  B  -  d  -  E2  -  2EF  -  F2  A  -  B  -  d  -  E2  +  2EF  -  F2 

Combining  appropriately,  we  find  that 

(25)  C  -  d  -  A2  -  B2 

(26)  E  -  d  -  C2  -  D2 

(27)  A  -  d  -  E2  -  F2 

and  that 

(28)  1  -  -  8ACE. 

The  next  step  is  to  define  the  polynomials  associated  with  each  root  set.  This 
is  done  as  follows. 

(29)  P+  -  (z  -  Zj)(z  -  Z2)(z  -  Z«), 

(30)  P_  -  (z  -  Z6)(z  -  Z5)(z  -  Z3). 

Expanding  equation  (29)  ,  we  have 

P+  •  z3  *  (Zi  +  Zz  +  Z4)  z2  +  (Z1Z2  +  Z2Z4  +  Z4Zj)  -  i-iZ2Z4 
P+  -  z3  -  ((A  +  C  +  E)  +  (B  +  D  +  F)]  z2 
+  [(AC  +  CE  +  EA  +  BD  +  DF  f  FB) 

+  (AD  +  BC  +  CF  +  DE  +  BE  +  AF) ]  z 
-  [ (ACE  +  ADF  +  BCF  +  BDE) 

(31)  +  (ACF  +  ADE  +  BCE  +  BDF) ] , 

(32)  P*  =  z3  -  (R  +  S)  z2  +  (T  +  U)  z  -  (V  +  W). 

In  equation  (32),  the  symbols  R,  S,  T,  U,  V,  and  U  represent  the  corresponding 
combinations  of  A,  B,  C,  D,  E,  and  F  in  equation  (31).  These  are 
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D  -  -2AB, 
F  -  -2CD, 
B  -  -2EF, 


(33) 


R  -  A  +  C  +  E, 


(34)  S  -  B  +  D  +  F, 

(35)  T  -  AC  +  CE  +  EA  +  BD  +  DF  +  FB, 

(36)  U  -  AD  +  BC  +  CF  +  DE  +  BE  +  AF, 

(37)  V  -  ACE  +  ADF  +  BCF  +  BDE, 

(38)  W  -  ACF  +  ADE  +  BCE  +  BDF. 

These  particular  combinations  were  chosen  because  the  set  S,U,W  changes  sign  in 
P_.  As  a  result,  we  can  write 

(39)  P_  -  z3  -  (R  -  S)*z2  +  (T  -  U)*z  -  (V  -  W). 

Since  the  reduced  polynomial  is  the  product  of  P+  and  P_,  we  can  write 
E3(z , d)  -  P+  P_ 

-  z6  -  2*R*z5  +  (R2  -  S2  +  2*T)*z4 

-  2*(RT  -  SU  +  V)*z3  +  (2*RV  -  2*SW  +  T2  -  U2)*z2 

(40)  -  2* (TV  -  UW)*z  +  (V2  -  W2). 

Equating  coefficients  between  equation  (40)  and  equation  (11),  we  find 


(41) 

R 

-  1/2, 

(42) 

(R2  - 

S2  + 

2*T) 

-  (1  - 

3*d), 

(43) 

-  2*(RT  - 

SU  +  V) 

-  -  (1 

*  2*d), 

(44) 

(2*RV 

-  2*SW  +  T2  -  U2) 

-  (1  - 

3*d  +  3*d' 

'2), 

(45) 

-  2*(TV  - 

UW) 

-  -  (1 

-  2*d  +  d' 

'2), 

(46) 

(V2  - 

W2) 

-  (1  - 

d  +  2*dA2 

-  dA3) 

Another  relation  is  obtained  from  equation  (36)  above  for  U  by  adding  and 
subtracting  the  appropriate  terms  as  follows 


(47) 

(48) 


U-AD+AF+AB-AB+CB+CF+CD-CD 
+  ED  +  EB  +  EF  -  EF, 

U  -  (A  +  C  +  E)*(B  +  D  +  F)  -  (AB  +  CD  +  EF) , 
U  -  RS  +  (1/2)*S  -  S. 


The  simplification  from  equation  (47)  to  equation  (48)  came  about  from  applying 
the  right  members  of  equations  (25),  (26),  and  (27)  and  equations  (33),  (34), 
and  (41) . 
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Substituting  equations  (41)  and  (48)  into  equations  (42)  through  (45)  (equation 
(46)  is  not  needed) ,  we  obtain 

(49)  U2  -  2*T  +  3*(d  -  1/4), 

(50)  T  -  2*UZ  +  2*V  -  (1  -  2*d) , 

(51)  V  -  2*UW  +  T2  -  U2  -  (1  -  3*d  +  3*dA2)  , 

(52)  2*UW  -  2*TV  -  (1  -  2*d  +  dA2). 

Substituting  equation  (49)  into  equation  (50),  equation  (52)  into  equation  (51), 
and  then  the  result  into  the  new  (50),  we  obtain 

(53)  T2  +  2*T*d  +  dA2  -  1/4. 

Since  equation  (53)  is  a  quadratic  involving  only  T,  it  may  be  solved  using  the 
quadratic  formula  and  then  the  remaining  coefficients  of  equation  (32)  are 
available  directly.  A  bit  of  caution  is  required  here,  however,  to  insure  that 
the  remaining  identifications  are  consistent.  Solving  equation  (49)  for  T  and 
substituting  into  equation  (53)  and  using  equation  (48) ,  we  obtain 

(54)  S*  -  2*(d  -  3/4)*S2  -  7/16  +  (3/2)*d  -  dz. 

Completing  the  square,  we  obtain 

(55)  [S2  -  (d  -  3/4) ]2  -  1. 

The  roots  of  equation  (55)  are 

S  -  ±[±1  +  (d  -  3/4)  ] 1/2 
-  ±(d  +  1/4) 1/2  or 

(56)  -  ±(d  -  7/4)1/z. 

We  will  choose  the  positive  root  from  equation  (56)  because  of  the  known 
threshold  condition  (13),  but  it  is  interesting  that  the  period  one  threshold 
condition  has  shown  up.  The  implications  of  this  have  not  been  investigated. 

(57)  S  -  +(d  -  7/4) 1/2  -  U. 

This  result  may  now  be  used  to  obtain  all  the  other  coefficients  by  substitution 
in  sign  sensitive  expressions. 

(58)  T  =  (1/2) [S2  -  3*(d  -  1/4)]  -  -(d  +  1/2), 

(59)  V  -  ( 3/2 ) *T  +  (2*d  -  1/4)  -  (l/2)*d  -  1, 

(60)  W  -  [TV  -  (1/2)*(1  -  d)2]/S  -  -d*(d  -  7/4)1/z. 
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S,  U,  and  W  are  proportional  to  the  factor  (d  -  7/4),  which  is  zero  at  d  -  D3 , 
the  threshold  condition.  This  threshold  condition  manifests  itself  in  the 
recursion  polynomial  in  a  remarkably  simple  way.  Equation  (32)  may  now  be 
written  with  its  full  d  dependence  as 

P+  -  z3  -  [(1/2)  +  (d  -  7/4)1/2]*z2 

+  [ - (d  +  1/2)  +  (d  -  7/4) 1/2 ] *z 

(61)  -  [  (1/2)  (d  -  2)  -  d*(d  -  7/4)l/,2j  . 

The  roots  of  equation  (61)  will  be  derived  below.  Since  the  coefficients  of 
equation  (61)  can  be  complex  in  the  general  case,  there  is  not  in  general  a  real 
root.  At  threshold  the  polynomial  is 

(62)  P+(D3,z)  -  z3  -  (l/2)*z2  -  (9/4)*z  +  (1/8). 

The  numerical  values  of  these  roots  are 

(63)  Zi(D3)  -  -1.30193773, 

(64)  Z2(D3)  -  0.05495813, 

(65)  Z*(D3)  -  1.74697960. 

Repeated  application  of  equation  (1)  cycles  through  equations  (63),  (64),  and 

(65)  as  expected.  Since  the  root  expressions  for  d  general  are  so  complicated, 
only  one  will  be  written  out.  The  cubic  pattern  (Abramowitz,  1970)  is 

(66)  z3  +  a2*z2  +  ax*z  +  a0  -  0. 

In  terms  of  these  coefficients,  four  other  auxiliary  quantities  are  defined 

(67)  q  -  (l/3)*a3  -  (l/9)*a22, 

(68)  r  -  (l/6)*(aj*a2  -  3*a0)  -  (l/27)*a23, 

(69)  sj.  -  [r  +  (q3  +  r2)1/2]1/3, 

(70)  s2  -  [r  -  (q3  +  r2)1/2]l/3. 

The  least  complicated  root  is 

(71)  Z  -  s1  t  s2  -  a2/3 . 


In  the  case  of  equation  (61) ,  the  pattern  coefficients  become 


(72) 

a2  -  -  [(1/2)  +  (d  - 

7/4) 1/2  ]  , 

(73) 

a3  -  +  [-(d  +  1/2)  + 

(d  -  7/4)1/2l  , 

(74) 

a0  -  -  [ (1/2) (d  -  2) 

-  d*(d  -  7/4) 1/2 
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The  least  complicated  root,  identified  with  Z*(d) ,  is  then 


(75) 


-  J-i}  [1  ♦  Ud  -  7)1'2) 

*  j— |_j{t-i4  +  12d  -(1*8 d)  (4 d  -  7 )  1^s] 

-  33/*  [  (4 d  -  7)  -  led2  «■  8 d (4 d  -  7 ) t/a)  Ui)'n 

*  j—l—Ju-l*  +  12 d  -  (1  ♦  8 d)  <4d  -  7)l'»] 

♦  32'2  (Ud  -  7)  -  16  d2  +  8d(4d  -  7 )  l'a]  1/a}1/1 


Substitution  of  the  threshold  D3  for  d  returns  equation  (65).  Z*(1.8)  is 
1.74734295.  Three  applications  of  equation  (1)  return  this  same  number, 
verifying  the  analytical  choices. 

CONCLUDING  REMARKS.  The  author  is  indebted  to  Dr.  D.  M.  Giarrusso,  then 
a  member  of  the  Mathematical  Sciences  Institute  at  Cornell,  now  at  St.  Lawrence 
University,  for  the  location  of  the  early  works  of  Abel  (1829)  and  Netto  (1898). 
The  solution  method  reported  in  this  paper  is  independent  of  Lagrange,  but  of 
course  gives  the  same  expressions  for  the  roots.  Abel  proves  that  solutions  of 
polynomials  whose  roots  are  connected  by  a  rational  expression  and  repeat  after 
n  applications  of  this  expression  (our  period  n  quadratic  recursion  is  a  special 
case)  have  the  following  property.  Given  the  order  of  the  recursion  polynomial 
is  v  -  m*n,  u,n,m  integers,  then  this  polynomial  may  be  factored  into  m 
polynomials  of  order  n.  These  polynomials  I  will  call  root  polynomials.  Root 
polynomials  are  always  solvable  for  any  n  because  the  coefficients  in  them  are 
related  to  each  other  by  rational  functions  that  can  be  determined.  Further,  if 
m  is  a  product  of  other  integers,  say  v2,  .....  i/j ,  then  root  polynomials  may 
be  separated  by  solution  of  j  other  polynomials  whose  orders  are  the  i/’s.  These 
polynomials  I  will  call  separation  polynomials.  The  following  table  is 
constructed  for  quadratic  recursion  to  reflect  this  property. 


Table  1.  Order  Properties  of  Quadratic  Recursion  Polynomials 
_ Polynomial  Type _ 


Period 


Recursion 


Primitive 


Separation 


1  2  2  2  N/A 

2  4  2  2  N/A 

3  8  6  3  2 

4  16  12  4  3 

5  32  30  5  2,3 

6  64  54  6  3,3 

7  128  126  7  2,3,3 

8  256  240  8  2,3,5 


As  we  have  seen  above  for  period  3,  the  separation  polynomial  (equation  (53))  is 
order  2  in  accordance  with  table  1.  For  period  4  (Auvermann,  1992a,  equation 
(35))  the  separation  polynomial  was  easily  found  by  the  method  used  there  and  was 
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indeed  order  3.  The  separation  was  so  straightforward  for  period  4  that  the 
root  polynomials  did  not  have  to  be  solved.  Once  the  separation  polynomial  was 
solved,  finding  the  stable  points  involved  only  a  series  of  square  root 
extractions.  This  simple  procedure  was  not  used  above  for  period  3,  but  rather 
the  root  polynomial  itself  was  solved.  The  fact  that  equation  (28)  and  equation 
(80)  were  not  used  in  the  solution  perhaps  indicates  that  some  manipulation  of 
equations  (25),  (26),  (27),  and  (28)  not  yet  found  would  result  in  the  identifi¬ 
cation  of  equation  (53)  without  resort  to  the  root  polynomials. 

There  is  more  than  one  separation  polynomial  for  periods  5  through  8  because  m 
is  not  prime.  The  separation  polynomials  for  periods  5,  6,  and  7,  if  they  can 
be  written  down,  are  all  solvable  as  indicated  in  table  1.  Period  8  is  the 
lowest  with  a  separation  polynomial  of  order  5,  which  is  not  solvable  by 
conventional  techniques.  Experience  with  the  method  of  the  present  paper  has  not 
been  deep  enough  to  determine  if  it  will  aid  solution  of  periods  5,  6,  and  7. 
Certainly,  expressions  similar  to  (25),  ...,(28)  appear  for  all  periods.  These 
facts  coupled  with  the  result  from  Abel  that  the  individual  polynomials  for  each 
root  set  can  be  solved  algebraically  give  substance  to  the  idea  that  our  method 
can  be  extended  further  than  period  four . 
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Abstract.  This  paper  describes  a  computational  scheme  for  modeling  one-dimensional 
flow  of  an  elasto-plastic  material.  The  method  is  based  upon  a  conservative  formulation  for 
elasto- plasticity  in  the  Eulerian  frame;  it  uses  a  second-order  Godunov  scheme.  To  validate 
the  method,  numerical  simulations  of  loading  in  plate  impact  problems  are  compared  with 
experimental  results. 


1.  Introduction 


Metals  and  other  elasto-plastic  materials  exhibit  a  variety  of  interesting  and  complicated 
wave  patterns  when  they  are  subjected  to  high  stresses  and  strains.  For  instance,  a  shock 
wave  can  split  into  a  double  wave  pattern,  with  an  elastic  precursor  followed  by  a  plastic 
compression  wave.  The  precise  structure  of  such  a  wave  reflects  the  constitutive  properties  of 
the  material,  including  the  elastic  response,  yield  criterion,  and  rate  sensitivity.  To  properly 
resolve  elasto-plastic  phenomena,  numerical  methods  for  modeling  the  flow'  must  be  of  high 
quality. 

In  this  article,  we  describe  a  numerical  method  for  modeling  one-dimensional  elasto- 
plastic  flow.  The  method  is  based  on  a  formulation  of  the  governing  equations  as  conserva¬ 
tion  laws  in  the  Eulerian  frame:  it  employs  a  high-resolution  Godunov  method.  We  validate 
our  method  by  applying  it  to  the  problem  of  loading  in  high-velocity  impact  of  metal  plates. 
The  results  are  compared  with  experimental  measurements  and  with  other  numerical  com¬ 
putations.  These  comparisons  show  that  the  method  is  successful  at  resolving  the  flow,  and 
suggests  that  extensions  of  our  approach  to  loading-unloading  and  multidimensional  flows 
will  be  useful. 

Our  computational  method  is  based  on  a  fully  conservative  Eulerian  formulation  of  the 
equations  of  motion  for  an  elasto-plastic  medium  proposed  by  Plohr  and  Sharp  (13).  By 
contrast,  the  numerical  methods  commonly  used  for  such  computations  are  nonconservative 
and  Lagrangian.  Our  computational  method  has  several  advantages  over  the  more  traditional 
methods.  The  Eulerian  formulation  avoids  the  problems  of  poor  resolution  and  numerical 
diffusion  caused  by  the  spatially  nonuniform  grids  and  frequent  remeshing  of  Lagrangian 
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x  =  0 


Figure  1:  A  schematic  representation  of  the  impact  problem.  A  flyer  plate  moving  at  high 
velocity  collides  with  a  stationary  target  plate. 


methods;  instead,  we  are  able  to  use  a  fixed  spatial  grid.  The  conservative  formulation 
of  the  flow  equations  insures  that  the  speeds  of  the  discontinuous  waves  produced  by  the 
interaction  are  correctly  computed.  Also,  the  conservation  form  of  the  equation  makes  it 
possible  to  use  a  high-resolution  Godunov  method  for  the  computation  of  the  flow.  Such 
methods  have  been  extremely  successful  in  computations  of  gas  dynamical  flows,  and  they 
are  beginning  to  prove  themselves  as  equally  successful  in  the  computation  of  elastic  and 
plastic  waves  [18,  2,  5].  In  future  extensions  of  this  work,  we  intend  to  implement  front 
tracking;  we  expect  that  tracking  will  improve  the  resolution  of  these  waves  considerably. 

To  test  our  implementation,  we  have  used  it  to  study  the  high-velocity  impact  of  two 
tantalum  metal  plates.  A  schematic  diagram  of  this  experiment  is  shown  in  figure  1.  A  flyer 
plate  moving  to  the  right  collides  with  a  stationary  target  plate;  the  velocity  of  the  flyer 
is  sufficiently  high  to  produce  plastic  deformation  in  both  plates.  The  outer  edges  of  the 
two  plates,  the  so-called  free  boundaries,  are  not  constrained.  The  results  of  our  simulations 
are  compared  with  the  numerical  computations  of  Steinberg  and  Lund  [16]  and  with  the 
experimental  data  cited  in  this  reference. 

Although  the  computations  presented  in  the  article  are  only  one-dimensional,  the  con¬ 
servative  Eulerian  formulation  applies  to  fully  three-dimensional  flows.  Our  implementation 
has  been  designed  to  be  extended  to  treat  multidimensional  problems.  This  extension  is  now 
being  developed,  and  will  be  applied  to  study  important  problems  in  elasto-plastic  flow,  such 
as  the  production  of  shear  bands  and  the  multidimensional  stability  of  elastic  and  plastic 
waves. 
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2.  Elasto-Plastic  Dynamics 

This  section  briefly  describes  the  system  of  nonlinear  hyperbolic  equations  governing  the 
finite  deformation  of  an  elastic-plastic  material  in  the  Eulerian  frame.  We  use  here  the  fully 
conservative  formulation  of  these  equations  recently  proposed  by  Plohr  and  Sharp  [13]. 

The  conservation  laws  are  supplemented  by  constitutive  relations  for  the  elasto-plastic 
material.  In  this  work,  we  take  the  material  to  be  hyperelastic  [8,  14],  with  strain  energy  of 
the  form  appropriate  for  deformations  with  small  anisotropy  [20,  6];  thus  the  internal  energy 
function  is  the  sum  of  a  hydrostatic  term  and  a  small  sheax  term.  The  plastic  flow  rule  we 
adopt  is  that  of  Levy-St.  Venant  (or  Prandtl-Reuss):  the  plastic  strain  rate  is  parallel  to 
the  deviatoric  stress.  We  impose  this  rule  in  the  intermediate  configuration,  so  that  plastic 
deformation  preserves  volume. 

The  main  restriction  in  the  current  numerical  calculations  is  that  the  physical  variables 
depend  upon  only  one  spatial  variable.  In  our  application  to  normal  impact  problems,  we 
also  make  the  simplifying  assumption  of  uniaxial  deformations  (cf.  references  [17]  and  [7]). 

Let  us  now  summarize  the  conservation  laws  and  the  constitutive  relations.  For  more  on 
the  derivation  of  the  equations,  see  reference  [13]. 

2.1  Conservation  Laws.  The  deformation  of  a  continuous  medium  is  represented,  in  the 
Eulerian  picture,  by  the  inverse  deformation  map  xpa  carrying  the  medium  from  its  current 
configuration  to  its  undeformed  configuration:  Xa  =  xpa(x',t),  where  =  1,2,3.  The 
evolution  of  this  map  is  governed  by  partial  differential  equations  of  second  order,  but  these 
equations  can  be  reduced  to  first  order  by  introducing  the  derivatives  of  tpa.  The  gradient 
gai  :=  dipa /dx'  is  called  the  inverse  deformation  gradient,  whereas  the  time  derivative  is 
related  to  the  particle  velocity  v'  through  d^jdt  =  —gakVk.  We  use  gal  and  v'  as  flow 
variables  in  place  of  ipa. 

The  governing  conservation  laws  involve  several  other  quantities  that  characterize  the 
state  of  the  material  and  its  dynamic  response:  the  Cauchy  stress  tensor  a'\  the  specific 
internal  energy  e,  the  plastic  strain  tensor  (measured  relative  to  the  undeformed  config¬ 
uration),  the  plastic  source  term  i\ag,  the  strain-hardening  parameter  k,  and  the  hardening 
source  term  h.  These  quantities  are  discussed  further  below.  It  is  also  convenient  to  intro¬ 
duce  the  deformation  gradient  tensor  F'a  :=  {g*1)'  a,  the  Jacobian  J  :=  det  F,  the  mass 
density  p  :=  pQJ_1  (with  p0  being  the  mass  density  of  the  undeformed  material),  and  the 
specific  volume  r  :=  1/p. 

Throughout  this  paper,  we  assume  that  the  flow  variables  depend  upon  only  one  space 
variable,  say  x  :=  x1.  In  this  instance,  the  principle  of  continuity  (i.e.,  equality  of  mixed 
partial  derivatives  of  ipa),  the  conservation  laws  of  momentum  and  energy,  and  the  flow  rules 
for  the  internal  plastic  variables  can  be  expressed  as  follows: 
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d  d  / 

sW+sr)  =  **■ 

(2.5) 

sW+eW  = 

(2.6) 

The  principle  of  continuity  implies  that  the  components  gai  for  a  — 
are  independent  of  time.  For  simplicity,  we  assume  that  the  initial  state 
<7|<=o  is  the  identity  matrix.  Therefore,  for  all  time, 

1,2,3  and  i  =  2,3 
is  unstrained,  i.e., 

91 2  =  91 3  =  92 3  =  9*2  =  0  ,  922  =  g33  =  1  . 

(2.7) 

In  particular,  =  J-1. 

Let  us  denote  by  U  the  14-component  vector  of  conserved  quantities 

U  :=  (gai,pv\p(\vivi +  e),pK,pEfa))T  ,  (2.8) 

with  a,  i  =  1,2,3.  Here  E^  represents  Voigt  indexing  of  the  six  independent  components 
of  (the  indices  (a)  =  1,  ...,  6  corresponding  to  a/?  =  11,  22,  33,  23,  31,  12.)  Then 
Eqs.  (2.1)  and  (2.3)-(2.6)  form  a  system  of  conservation  laws 

+  =  (2.9) 

2.2  Constitutive  Relations.  Since  we  assume  that  the  response  of  the  material  to 
deformation  is  hyperelastic,  the  thermodynamic  variables  determine  the  specific  internal 
energy  e  of  the  material  through  an  equation  of  state  [8]  of  the  form 

£  =  £(Ea0,EZ0,K,rf)  ,  (2.10) 

where 

Fap  ■=  5  \EkaFkp  -  6a/3] 

is  the  (Lagrangian)  strain  tensor  and  77  the  specific  entropy 
dynamic  inequality  leads  to  the  identification 

•  (2J2) 

The  motivation  for  the  specific  choice  of  energy  function  in  Eq.  (2.10)  derives  from  the 
following  microscopic  picture.  At  each  point  of  the  material,  we  imagine  cutting  out  a 
small  neighborhood  and  relaxing  the  forces  on  its  surface,  recovering  an  intermediate  stress- 
free  configuration.  Because  of  plastic  working,  this  configuration  differs  from  the  original 
Lagrangian  configuration.  The  linear  map  ( Fp)aa  carrying  the  Lagrangian  configuration 
to  the  intermediate  configuration  is  therefore  regarded  as  a  measure  of  local,  irrecoverable 
deformation.  This  suggests  decomposing  the  deformation  gradient  as  a  product  [10]  of  elastic 
and  plastic  parts: 

F'a  =  (Fe)'  o  (^p)°  o  •  (2.13) 


(2.11) 

.  The  Clausius-Duhem  thermo- 
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The  elastic  deformation  gradient  (Fe)‘  „  can  be  regarded  as  mapping  the  intermediate  con¬ 
figuration  to  the  Eulerian  configuration.  For  more  on  the  Lagrangian,  intermediate,  and 
Eulerian  configurations,  see  reference  [14]. 

In  these  terms,  we  identify  the  Lagrangian  plastic  strain  tensor  as 

^  =  (2!4) 
We  also  define  the  elastic  strain,  measured  with  respect  to  the  intermediate  configuration, 

E.V=  5  [W*.  <..]  ■  (215) 

Since  we  are  modeling  metals  at  high  strain  rates,  we  assume  the  material  to  be  isotropic 
in  the  intermediate  configuration.  Therefore  the  internal  energy  depends  on  the  principal 
isotropic  invariants  of  E*b  alone  [10].  These  isotropic  invariants,  in  fact,  can  be  written  as 
functions  solely  of  Eap  and  E*p  [13];  therefore  isotropic  energy  functions  have  the  form  of 
Eq.  (2.10).  (From  this  perspective,  the  plastic  and  elastic  parts  of  the  deformation  gradient 
appearing  in  the  decomposition  (2.13)  need  never  to  be  introduced.) 

Of  particular  importance  is  an  invariant  measure  of  shear  strain.  In  analogy  with  the 
definition  used  in  reference  [6],  we  define  such  a  measure  ce  by 

^:=||dev£'||=,  (2.16) 

where  the  norm  ||A||  and  deviator  dev  A  of  a  3  x  3  matrix  A  are  defined  by  |)A||2  =  tr  A2 
and  dev  A  :=  A  —  |  tr  A  /,  with  /  being  the  3x3  identity  matrix.  Equivalently,  ie  satisfies 

e1  =  I||dev6e||2,  (2.17) 


where  6e  is  the  elastic  Finger  tensor 


6e  :=  F'Fj  =  F(/  +  2FP)_1  Fr  . 


(2.13) 


Following  Wallace  [20]  and  Garaizar  [6],  we  take  the  specific  internal  energy  £  to  be 


£  =  Sh(r,7j)  +  T0G(r,7?)e2  , 


(2.19) 


where  the  first  term  represents  a  hydrostatic  contribution  to  the  energy  and  the  second  ac¬ 
counts  for  small  shear  deformations.  Models  for  the  hydrostatic  energy  Sh  and  the  shear 
modulus  G  are  given  below.  Using  the  small-shear  equation  of  state  (2.19),  the  identifica¬ 
tion  (2.12)  leads  to  the  stress-strain  relation 


<7  = 


dG  2 

-P  +  r0  — ce 


I  +  J~lGbe  dev  bt 


(2.20) 


For  the  hydrostatic  component  of  the  energy,  £h,  our  computations  use  a  stiffened  poly¬ 
tropic  equation  of  state  [4,  12]: 

4(ri 7)  •■=  — — t  (—)  eh~1)l,/R(po  +  Poo)t0  +  P<x>t  .  (2.21) 

7  —  1  \  To  / 
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Here  t0  =  1  j p0  and  p0  are  the  specific  volume  and  pressure  in  the  undeformed  state,  while 
7,  Poo,  and  R  are  prescribed  constants  that  characterize  the  material,  7  being  dimensionless, 
poo  being  a  pressure,  and  R  having  units  of  entropy. 

In  elastodynamics,  the  pressure  p,  temperature  T,  adiabatic  bulk  modulus  K,  and 
Gruneisen  coefficient  T  are  related  to  first  and  second  derivatives  of  the  internal  energy 
(see,  e.g.,  reference  [20]).  In  this  work,  we  make  the  same  identification  with  respect  to  the 
hydrostatic  energy 


P  =  — 


K  —  T 


d£u 

dr  ' 

d2£h 


T  = 

r  = 


54 

dV  ’ 

T  d2£h 


(2.22) 


dr 2  ’  *  Tdrdr)  ' 

By  straightforward  computations,  we  obtain  the  following  relations  in  the  undeformed  state: 


RT0  =  (p0  +  poo)r0  ,  K  =  (pa  t-pcob  ,  r  =  7  -  1  .  (2.23) 


The  bulk  modulus  K  and  the  Gruneisen  coefficiert  T  are  given  in  the  literature;  therefore 
we  can  use  Eq.  (2.23)  to  calculate  R,  p^,  and  7.  Table  la  shows  the  resulting  values  for 
tantalum. 

The  shear  modulus  G  in  our  calculations  is  taken  from  Steinberg  et  a 2.  [15]: 


G(r,r])  :=  G0 


1  +  Gp 


1/3 

p  -f  Gt(T  — 


(2.21) 


with  G0,  Gp,  and  Gt  being  material  constants  and  7o  the  temperature  in  the  undeformed 
state.  The  constant  Go  is  the  shear  modulus  of  the  material  in  the  unstrained  state.  See 
table  la  for  the  values  for  tantalum  given  in  reference  [15]. 

2.3  Plastic  Flow  Rule.  To  complete  the  specification  of  the  governing  equations,  we 
must  define  the  plastic  and  hardening  source  terms.  These  terms  are  zero  unless  the  shear 
stress  exceeds  a  certain  threshold,  the  yield  strength.  Rather  than  using  the  Cauchy  stress 
in  the  yield  criterion,  however,  we  use  the  stress  tensor  S  J F~lcr  (F~1)7  corresponding 
to  the  intermediate  configuration.  This  is  consistent  with  measuring  the  shear  strain  ee  in 
the  intermediate  configuration,  and  it  guarantees  that  plastic  deformation  does  not  change 
the  volume  (see  the  discussion  below). 

One  easily  sees  that  ||devSj|  =  J||  devcr6~1||.  Therefore  we  interpret  the  von  Mises 
criterion  for  elastic  flow  as 

||  dev(<j6~1)j|  <  \f\Yo  .  (2.25) 

For  the  static  yield  strength  >0  we  adopt  the  model  of  Steinberg  et  a/.  [15]: 


Vo  :=  Ys(k) 


Go 


with  the  strain-hardening  part  of  the  yield  strength  being 


(226) 


Ys(k)  :=  min  {  YA  (1  +  /3k)u  ,  Ymax  }  . 


'2.27) 


118 


A  Godunov  Scheme  for  Elasto-Plasticity 


Table  la 

Po(^j) 

7  (unitless) 

13539 

Poo  (Mbar) 

Go  (Mbar) 

Gp  (Mbar)-1 

G't  (kK)  1 

16.6 

2.67 

0.145 

0.72 

0.69 

1.45 

-0.13 

Table  lb 

Ya  (Mbar) 

f3  (unitless) 

n  (unitless) 

V^x  (Mbar) 

Yp  (Mbar) 

UK  (eV) 

uBtm 

0.00375 

10 

0.1 

0.011 

0.01 

0.31 

0.086171 

Table  lc 

model 

Cx  (MS"1) 

C2  (Mbar  ps) 

1 

0.71 

0.12 

2 

0.71 

0.012 

3 

7.1 

0.12 

4 

7.1 

0.012 

Table  1:  Values  of  material  parameters  for  tantalum,  as  used  in  the  numerical  simulations. 
The  constants  R,  p:o  and  7  are  computed  by  using  Eq.  (2.23),  with  K  and  T  taken  from 
reference  [15].  The  values  for  p0  and  k  can  be  found  in  standard  physics  tables,  such  as 
reference  [11],  The  values  for  YA,  /?,  n,  Ym&x>  Yp ,  Uk,  G0,  Gp,  and  Gj  are  those  used  in 
references  (15,  16].  Four  sets  of  values  are  used  for  C 1  and  G2,  as  indicated  in  table  lc. 


Here  YA,  Vm(vx,  /?,  and  n  are  material  constants.  See  table  lb  for  the  values  corresponding  to 
tantalum. 

During  plastic  loading,  where  inequality  (2.25)  is  violated,  the  plastic  strain  rate  tensor 
is  assumed  to  be  parallel  to  the  deviatoric  stress,  as  in  the  flow  rules  of  St.  Venant-Kirchhoff 
and  Prandtl-Reuss.  It  is  suggestive  to  write  the  plastic  strain  rate  in  the  intermediate 
frame  [10,  14]  as 


(L»EP)ab:=(F- 


)\EUF;')\  =  A 


(dev  S  )a b 
II  dev  S  || 


(2.23) 


where  A  is  a  nonnegative  scalar  function  that  vanishes  when  inequality  (2.25)  holds.  Such  a 
flow  rule  can  be  expressed  equivalently  as 


_  -  FTbe  1  dev  (abe  1 )  F 
■a0~l  |jdev(cr6-1)  || 


(2.29) 


The  flow  rule  (2.29)  (or  (2.28))  has  the  feature  that  plastic  deformation  causes  no  volume 
change  [13],  as  is  usually  assumed  in  metal  plasticity.  Indeed,  the  time  derivative  of  det  Fp  = 
[det(/  +  2£p)]1/2  is  proportional  to  the  trace  of  the  right-hand  of  Eq.  (2.28),  which  is  zero. 

To  define  A,  we  adopt  the  rate-dependent  model  of  Steinberg  and  Lund  [16],  making  the 
identification  of  y|/\  with  the  quantity  ip  of  this  reference: 


(2.30) 
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where 


1|  dev  (crb~l)  ||  —  \J\Y0 

\J\GjGo 


(2.31) 


is  the  thermally-activated  part  of  the  dynamic  yield  strength,  which  measures  the  extent 
to  which  the  yield  criterion  has  been  exceeded.  (Strictly  speaking,  Yp  is  set  to  zero  if  it  is 
negative,  and  to  Yp  if  it  exceeds  Yp.)  In  the  definition  of  A,  the  quantities  Cu  C2,  Yp,  and 
Uk/k  are  material  constants;  values  for  tantalum  are  shown  in  tables  lb  and  lc. 

Finally,  the  source  term  for  the  hardening  law  is 


h  : 


(2.32) 


thus  k  represents  the  accumulated  plastic  strain. 

Hoge  and  Mukherjee  [9]  have  reported  extensive  experimental  data  on  tantalum,  including 
plots  of  Yp  vs.  ep  at  T  =  0.3  kK  and  Yp  vs.  T  at  iv  =  10~lo/is“l.  Figure  2  compares  these 
data  with  the  predictions  of  the  models  correspondin',  to  the  different  sets  of  values  for  C\ 
and  C2  shown  in  table  lc. 

As  seen  in  figure  2,  model  4  seems  to  agree  best  with  the  experimental  data,  particularly 
in  the  range  of  ep  from  10-9/is-1  to  10-2/is-1.  This  observation  is  consistent  with  the 
results  of  numerical  simulations  discussed  in  section  4  below.  Notice  that  cp  has  a  maximal, 
or  saturation,  value.  For  model  1  the  saturation  value  is  0.0746  /us-1,  whereas  for  model  4 
it  is  0.746  /zs~l,  ten  times  larger.  These  values  are  to  be  contrasted  with  the  nominal  strain 
rates  in  the  impact  problems  discussed  below,  which  range  from  about  0.01  gs~l  to  0  l  /is-1. 

2.4  Uniaxial  Flow.  For  the  normal  impact  problem  we  consider,  the  governing  equations 
can  be  simplified  because  the  flow  is  uniaxial  (cf.  references  [17]  and  [7]).  The  inverse  defor¬ 
mation  gradient  has  the  form  g  —  diag(J-l,0,0)  and  the  velocity  vector  is  v  =  (tA,0,0)T. 
Furthermore,  the  plastic  strain  is  diagonal,  the  22  and  33  entries  are  equal  by  symmetry, 
and  det(7  +  2 Ep)  =  1  (there  being  no  plastic  volume  change);  therefore  we  can  use  a  single 
variable  if)  [10,  20]  to  describe  the  plastic  strain  tensor: 


EP  =  diag  0  -  0  4  («*  “  0 . 5  («*  -  l))  • 

(2.33) 

In  these  terms, 

6e  =  diag  (j2e2^,  e-^, 

(2.34) 

and 

3  =  § • 

(2.35) 

It  proves  convenient  to  ascribe  a  sign  to  ce,  defining 

:=  7S  («-*  -  J2‘U)  • 

(2.36) 

Then 

a"  =  -p  +  TQ^ll-2jlje™Gi, 

(2.37) 

and 

An  =  -v/fsgneee-20A  . 

(2.38) 
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(a)  (b) 

Yt  [Mbar]  YT  [Mbar] 
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ep  [(/xs)-1]  TfkK] 

Figure  2:  A  comparison  of  constitutive  models  with  experiment,  (a)  Thermal  yield  stress 
vs.  plastic  strain  rate  at  T  =  0.3  kK.  (b)  Thermal  yield  stress  vs.  temperature  at  ep  — 
10-10  /xs— 1 ;  models  1  and  2  are  nearly  overlapping,  as  are  models  3  and  4. 

The  conservative  system  (2.9)  can  now  be  simplified  to  be 


d  d  (  ,s 

wtl,+rApv) 

=  0, 

(2.39) 

IK  j(-')2+*) 

=  0, 

(2.40) 

(u1)2  +  ej  vl  —  v1a1,| 

=  0, 

(2.41) 

Ft {pK)  +  Tx  (pKV''> 

=  ph  , 

(2.42) 

|  (/>*)  +  £  (/*»') 

=  psgneeh, 

(2.43) 

together  with  the  constitutive  relations  (2.19),  (2.37),  and  (2.32). 
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3.  Numerical  Method 

Our  computations  use  a  second-order  variant  of  Godunov’s  method.  The  basic  algorithm 
for  this  method  is  described  in  references  [IS,  I,  3,  18j.  Since  the  numerical  calculations  of 
this  paper  are  restricted  to  one  space  dimension,  we  describe  the  method  for  this  case  only. 

The  Godunov  type  methods  require  three  basic  steps:  reconstructing  the  discrete  solution 
from  cell  averages,  computing  the  interaction  of  the  incoming  waves  at  cell  edges  using 
the  method  of  characteristics,  and  updating  cell  averages  by  means  of  conservative  finite 
differencing. 

3.1  Characteristic  Analysis.  The  equations  governing  smooth  elastic-plastic  flow  can 
be  written  in  the  following  characteristic  form: 

I  V  +  A(V)±V  =  HV),  (3.1) 

where 

v-Of.y.T,*, £?.,).  (3.2) 

The  Jacobian  matrix  appearing  in  system  (3.1)  is 


A(V)  := 


-p  iAg  vl  I3x3  -p  M,  -p  M*  -p  Mi 


Mi  := 


(A  Y  -  ^ 
{Ak}  -  dK 


I  £p  trj,/c 


E,E  p,v 


M  := 


(^£p)'(a,)  := 


dEU 


The  source  term  in  system  (3.1)  is 


5(l/):=(0ix.,<l/(l)r),A,Aw)T  , 


with  d  denoting  the  plastic  dissipation  term 

.  di  di 

d:=-i,WEr/°‘,-pT*h-  (3-6) 

In  order  for  the  system  (3.1)  to  be  hyperbolic,  A{V)  must  have  real  eigenvalues.  Evi¬ 
dently,  eight  of  the  eigenvalues  of  A(V)  are  v\  with  the  corresponding  left  eigenvectors  being 
appropriate  rows  of  the  identity  matrix.  The  remaining  eigenvalues  are  those  of  the  6x6 
matrix 

d  . _  (  v1 13x3  g  . 

B  -(  -r'A,  •  (3',) 
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Since 

det(5-A/)  =  -det(-^M3s-(i>1-A)2  /3x3)  ,  (3.8) 

the  system  (3.1)  is  hyperbolic  if  and  only  if  the  matrix 

C:=--Asg  (3.9) 

P 

has  nonnegative  eigenvalues.  One  can  verify  that  —pClk  coincides  with  the  adiabatic  acoustic 
matrix  corresponding  to  the  indirection,  i.e.,  with  a’1*1  =  c*lfcl  4-  <7n<5,fc,  so  that  C  is  a 
symmetric  matrix.  Therefore  there  are  eight  characteristic  speeds  equal  to  v1,  three  equal  to 
v1  plus  the  positive  square  roots  of  the  eigenvalues  of  C,  and  three  equal  to  vl  minus  these 
square  roots. 

Assume  that  the  system  (3.1)  is  hyperbolic.  Then  we  can  find  a  nonsingular  matrix  L, 
the  rows  of  which  are  the  left  eigenvectors  of  the  matrix  C ,  and  a  diagonal  matrix  II,  whose 
diagonal  entries  are  the  square  roots  of  the  eigenvalues  of  C,  such  that 


LC  =  U2L  . 


(3.10) 


We  choose  L  such  that  LLT  —  I.  The  left  eigenvectors  of  B  are  then  determined  by  the 
following  relation: 

(  -I IT/  L  \  f  vl  /3x3  g  \ 

\  ULf  L  )  vi/3x3; 

_  (  v'  /3x3  -  n  0  )(  -n Lf  L  \ 

^  o  vli3x3  +  u  )  {  mf  l  )  ■ 


The  right  eigenvectors  can  be  obtained  in  a  similar  manner.  Finally,  in  terms  of  the  3x8 
matrix 

D-.=  --(Av,Ak,Ae p)  ,  (3.12) 

P 

the  left  and  right  eigenvectors  l\,  •••,  /14  and  rq,  •  • -,  r14  of  the  Jacobian  matrix  A(V)  are 
given  by 

/  -ULf  L  -U~lLD  \ 

(/i,--,/i4)'r=  n  Lf  L  n  ~lLD  (3.13) 

\  0  0  /8x8  / 


and,  with  R  :=  £T, 


(ri,---,r14)  = 


-gC~lD 

\R  \R  O3X8 

08X3  08X3  hxs 


(3.14) 


These  eigenvectors  satisfy  the  normalized  biorthogonal  condition,  i.e.,  /,  •  r}  —  8^. 
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3.2  Godunov  Scheme.  In  the  numerical  scheme,  the  flow  is  represented  by  associating 
states  with  points  on  a  grid;  each  state  represents  the  average  of  the  flow  state  over  the  mesh 
cell  centered  at  the  grid  point.  We  choose  a  fixed  Eulerian  spatial  grid,  indexed  by  i,  with 
spatial  increment  Ax.  To  advance  the  solution  from  time  tn  to  fn+i,  we  use  a  time  step  At 
that  satisfies  the  Courant-Friedrichs-Lewy  (CFL)  stability  condition 

Ax 

At  =  c -  ,  (3.15) 

P’max 


where  /Wx  is  the  largest  absolute  value  of  the  eigenvalues  of  A(V)  and  c  <  1  is  a  positive 
constant  called  the  CFL  number.  In  our  numerical  experiments,  c  <  0.5. 

Given  the  increments  Ax  and  Af  above,  the  method  consists  of  four  steps: 

(1)  Reconstruction  of  the  flow  profile  from  the  cell  averages.  This  step  constructs  a  piece- 
wise  linear  approximation  to  V{x,  tn)  by  determining  a  slope  AV/Ax,  which  approximates 
dV/dx,  in  each  cell.  A  slope  limiter  is  employed  to  avoid  introducing  extraneous  local  minima 
and  maxima.  In  our  computations,  we  use  the  standard  van  Leer  slope  limiter  [19,  1,  3]. 

(2)  Computation  of  the  half-step  left  and  right  states,  and  at  the  edges 

of  each  cell.  These  states  are  found  by  freezing  coefficients  in  system  (3.1)  and  integrating 
along  characteristics. 

(3)  Resolution  of  wave  interactions.  We  solve  the  Riemann  problem  for  system  (2.9) 
at  each  cell  edge,  with  left  and  right  states  obtained  from  step  (2).  We  currently  use  an 
approximate  Riemann  solver  based  on  freezing  coefficients  in  system  (3.1). 

(4)  Conservative  finite  differencing.  Using  the  states  V  ,2  of  step  3  to  approximate  the 

•  *** 

flux  and  source  terms  in  system  (2.9),  the  states  at  time  t„+1  are  obtained  from  the  flux 
balance  relation: 


U. 


n+1 


=  Un-  —  f 
'  Ax 

At 

+~ 


H 


vnV 


-H[U[Vn+) 
l~  2 


(H 

s  (u  +  s  (u  (v"4> 


(3.16) 

(3.17) 


where  U  {V)  is  the  invertible  map  carrying  V  to  U. 
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Table  2a 

initial  state  for  flyer 

initial  state  for  target 

9  —  hxz 

V1  =  Vjr  ,  V2  =  V3  =  0 

7)  =  0 

K  =  0 

=  03X3 

9  ~  hxz 

vl  =  v2  =  v3  =  0 

7?  =  0 

K  =  0 

Ep  =  03x3 

Table  2b 

test 

161 

232 

310 

B 

impact  velocity  (cm//rs) 

0.0161 

0.0232 

0.0310 

0.0390 

Table  2:  Initial  and  parameter  values  used  for  the  numerical  simulations. 


4.  Numerical  Results 

We  apply  our  code  to  the  problem  of  the  collision  of  two  tantalum  plates.  This  problem 
is  similar  to  those  presented  in  references  [21,  16,  18].  The  initial  data  and  values  for  the 
parameters  for  the  constitutive  laws  are  eiven  in  tables  2  and  1.  The  simulation  begins  as  the 
flyer  plate  strikes  the  target  plate  at  x  =  0.  The  collision  initiates  several  waves,  two  moving 
right  and  two  moving  left.  The  faster  wave  in  each  group  is  called  the  elastic  precursor, 
and  the  slower  is  called  the  plastic  compression  wave.  When  the  waves  moving  right  hit 
the  free  surface  of  the  target  plate,  the  surface  is  accelerated.  The  waves  moving  left  reflect 
from  the  free  surface  of  the  flyer  plate;  when  they  subsequently  hit  the  free  surface  of  the 
target  plate,  this  surface  is  decelerated.  While  the  free  boundary  of  the  target  plate  is  being 
accelerated,  it  is  said  to  be  undergoing  loading;  similarly,  the  deceleration  of  this  boundary 
is  called  unloading.  A  schematic  diagram  of  the  wave  structure  of  the  impact  problem  is 
shown  in  figure  3a. 

In  physical  experiments,  one  typically  measures  the  velocity  of  the  free  boundary  of 
the  target  plate.  The  velocity  profile  for  the  complete  loading-unloading  process  resembles 
figure  3b.  When  the  elastic-plastic  wave  group  first  arrives  at  the  boundary,  the  elastic 
precursor  raises  the  velocity  to  a  level  that  is  roughly  independent  of  the  impact  velocity. 
As  the  material  yields,  the  strong  plastic  wave  loads  the  material  to  a  peak  velocity.  After  a 
plateau  at  the  peak  velocity,  which  is  approximately  the  same  as  the  impact  velocity,  waves 
reflected  from  the  free  surface  of  the  flyer  plate  arrive,  causing  unloading. 

At  present,  our  numerical  code  is  capable  only  of  resolving  the  loading  portion  of  the 
collision  process.  In  place  of  the  velocity  of  the  free  boundary  of  the  target  plate,  we  cal¬ 
culate  the  velocity  at  several  fixed  points  in  the  target  plate.  The  resulting  velocity  profiles 
correspond  directly  to  the  free-surface  velocity  prior  to  the  time  of  arrival  of  waves  reflected 
from  the  free  surface  of  the  flyer  plate.  Thus  we  can  monitor  how  the  shock  wave  in  the 
target  plate  splits  into  a  double  wave  structure. 

Figures  4  and  5  show  the  velocity  at  three  different  points  plotted  vs.  time;  superimposed 
in  each  plot  are  the  results  for  either  different  values  for  C\  and  C2,  different  grid  sizes,  or 
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(a)  (b) 


splits  into  elastic  and  plastic  waves  vp=  impact  velocity  G  =  shear  modulus 

vp  =  impact  velocity  p  =  density  K  =  bulk  modulus 


Figure  3:  (a)  Wave  structure  of  the  impact  problem,  (b)  A  schematic  profile  of  free  surface 
velocity  vs.  time. 

different  impact  velocities.  In  all  of  these  figures  except  figure  5b,  the  velocity  is  measured 
at  the  locations  x  =  0.1388  cm,  x  =  0.6003  cm,  and  x  =  0.96  cm  in  the  target  plate;  the 
corresponding  plots  appear  on  the  left,  in  the  middle,  and  on  the  right. 

Since  the  values  of  the  parameters  C\  and  C2  used  in  our  constitutive  model  are  diffi¬ 
cult  to  determine  from  fundamental  principles,  we  seek  to  determine  the  influence  of  these 
parameters  on  the  computed  solution.  For  test  390,  figure  4a  shows  the  superposition  of 
the  velocity  plots  for  four  separate  simulations,  using  the  values  of  Ci  and  C2  indicated  in 
table  lc.  Models  1  and  3  produce  a  similar  profile,  which  shows  almost  no  distinct  elastic 
wave.  Models  2  and  4,  for  which  ep  is  larger  for  large  Yj,  have  more  distinct  elastic  and 
plastic  waves.  Model  2  shows  an  elastic  precursor  wave  with  an  amplitude  about  10%  larger 
than  that  of  model  4.  Of  the  four  models  used  in  test  390,  model  4  seems  to  be  closest 
to  the  experimental  results  at  x  =  0.96  cm  shown  in  figure  6  of  the  work  of  Steinberg  and 
Lund  [16].  Figure  4b  shows  a  refinement  study  for  test  390.  A  grid  of  400  cells  resolves  the 
problem  successfully. 

In  comparing  the  results  of  the  four  different  tests  in  figure  5a,  we  observe  that  the 
effect  of  an  increased  impact  velocity  is  to  raise  the  plateau  velocity  at  the  target  points. 
In  test  161,  the  plateau  velocity  is  approximately  1.6  x  10“2cm/^s.  Test  232,  which  uses 
an  impact  velocity  that  is  44%  faster,  shows  a  peak  velocity  at  the  target  points  of  nearly 
2.3  x  10'2cm//zs.  Test  310,  which  uses  an  impact  velocity  that  is  34%  faster  than  test  232, 
shows  a  peak  velocity  at  the  target  points  of  almost  3.1  x  10~2cm//iS.  Test  390  uses  an  impact 
velocity  that  is  26%  higher  than  test  310;  the  peak  velocity  is  about  3.9  x  10_2cm//zs.  For 
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(a) 

[cm/ ns] 


(b) 

v1  [cm//is] 


Figure  4:  (a)  Effect  of  different  values  of  the  parameters  C i  and  Ci  on  the  computed  solutions 
in  test  390.  (b)  Grid  refinement  study  in  test  390. 


each  of  the  tests,  nearly  the  same  peak  velocity  is  obtained  at  the  three  points  i  =  0.1388  cm, 
x  =  0.6003  cm,  and  x  =  0.96  cm.  Most  importantly,  different  impact  velocities  have  little 
influence  on  the  amplitude  of  the  elastic  precursor  wave. 

Finally,  figure  5b  shows  the  velocity  plots  for  tests  161,  232,  and  390,  with  parameters 
C i  and  C-2  coming  from  model  4.  These  plots  most  closely  resemble  the  experimental  results 
shown  in  figures  4-6  of  the  work  of  Steinberg  and  Lund  [16]. 


5.  Conclusion 


We  have  described  a  numerical  method  for  the  computation  of  elasto-plastic  flows  in 
one  space  dimension.  The  key  feature  of  this  numerical  method  is  the  use  of  a  conservative 
Eulerian  form  of  the  equations  of  motion.  The  equations  of  motion  are  solved  using  a 
second-order  Godunov  method.  The  advantages  are  the  reduction  of  numerical  diffusion 
in  the  computed  solution  and  a  more  uniform  distribution  of  numerical  resolution  in  the 
computation. 

We  applied  this  numerical  method  to  the  problem  of  the  high-velocity  impact  of  two 
metal  plates.  The  results  of  these  computations  are  in  good  agreement  with  computations 
using  other  methods  and  with  experimental  results.  We  investigated  the  effect  of  varying 
certain  material  parameters  and  observed  significant  differences  in  the  flow  behavior.  Our 
conclusion  is  that  the  choice  of  material  parameters  corresponding  to  model  2  of  table  lc 
most  closely  reproduces  the  results  of  experiment. 
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(a)  (b) 

v1  [cm//is]  v1  [cm/^sj 


t  [/xs]  t  [H 

Figure  5:  (a)  Effect  of  different  impact  velocities  on  the  computed  solutions,  (b)  Velocity 
vs.  time  for  tests  161,  232,  and  390. 

Future  directions  of  development  of  our  numerical  code  will  include  the  use  of  front ' 
tracking  for  the  elastic  and  shear  waves  in  the  solid  medium,  and,  more  importantly,  the 
extension  of  the  method  to  flows  in  two  and  three  space  dimensions. 
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the  Phase  Transition  Problems  * 
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Abstract 

In  this  paper  we  first  summarize  the  earlier  results  on  the  slow  motion  in  the 
Korteweg  theory  of  cappilarity  in  the  one-dimensional  case  and  show  some  numerical 
results.  In  the  multidimensional  case  we  discuss  the  existence  of  local  solutions  to 
the  system  of  equations  for  compressible  fluids  of  Korteweg  type. 

1  Introduction 

In.  order  to  model  the  capillarity  effect  of  materials,  Korteweg  [12]  formulated  a  constitutive 
equation  for  the  Cauchy  stress  that  includes  density  gradients.  It  turns  out  that  his  theory 
is  useful  to  discuss  phase  transition  problems. 

First,  we  discuss  the  one-dimensional  isothermal  motion.  In  this  case  the  equation  we 
discuss  is  given  by 

(1.1)  utt  =  <t{u-)x  +  WJ.xxt  -  <?uxxxx,  0  <  x  <  1,  t  >  0. 

where  u  is  the  displacement  and  uxxt  and  uxxxx  terms  represent  the  viscosity  and  the  cap¬ 
illarity  effects,  respectively.  Typical  boundary  conditions  come  from  either  a  soft  loading 
device  or  a  hard  loading  device.  Although  the  slow  motion  occurs  in  both  cases,  in  this 
note  we  discuss  the  soft  loading  case  only  for  simplicity.  The  boundary  conditions  in  this 
case  are  given  by 

u(0.f)  =  0.  cr(ux)  +  z/ux£  -  ruxxx|x=,  =  P. 
uxx(0,t)  =  0,  uxx(l,£)  =  0. 

The  initial  conditions  are  given  by 

(1.3)  u(x,0)  =  /(x),  uf(x.O)  =  5(1), 

where  f,g  €  f/^O,  1).  The  boundary  conditions  (1.2a)  show  that  the  stress  P  is  applied 
at  x  =  1.  The  boundary  conditions  (1.2b)  are  the  natural  boundary  conditions  for  the 
corresponding  variational  problem. 

'The  author  was  supported  in  Dart  by  Army  Grant  DAAL  03-89-G-0088. 
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In  what  follows,  we  assume  that  <r  is  given  by  Fig.  1.1.  In  this  figure  (O.a*]  and 
[5* ,  oo )  axe  called  the  a-phase  and  the  ,i-phase,  respectively.  They  correspond  to  the 
different  phases  of  the  material.  The  interval  (a’,/?’)  is  called  the  spinodal  region  and 
physically  unstable.  We  denote  by  a,  6,  and  5  the  values  of  ux  at  the  intersections  of 
y  —  P  and  y  =  cr(ux)  in  the  a-phase,  the  spinodal  region,  and  the  ,5-phase,  respectively. 
The  value  of  P  for  which  areas  A  and  B  are  equal  is  called  the  Maxwell  line.  We  denote 
by  a^f,  5m?  &nd  d.vr  the  values  of  a,  5?  and  5,  respectively,  for  which  we  have  the  Maxwell 
line  construction. 


y 


The  capillarity  term  was  first  introduced  by  Korteweg  [12].  Recently,  various  effects  of 
this  term  have  been  discussed.  For  example,  Serrin  [15],  [16]  reconsidered  the  Korteweg 
theory  and  has  shown  the  existence  of  steady  profile  connecting  the  a-phase  and  the  5- 
phase.  Slemrod  [17]  and  Hagan  and  Slemrod  [9]  considered  the  existence  of  travelling  wave 
solutions.  The  static  problems  concerning  the  soft  loading  case  and  the  hard  loading  case 
have  been  discussed  in  [3]  and  [4],  respectively.  The  dynamical  aspects  of  these  loading 
cases  are  discussed  in  Hattori  and  Mischaikow  [10]  and  Andrews  and  Ball  [1], 

In  Section  2  we  summarize  the  result  in  [11]  about  a  slow  motion  of  (1.1)  resembling 
the  dynamics  of  (2.3)  discussed  in  [7],  [5],  [2],  and  [8].  In  Section  3  we  show  some  numer¬ 
ical  examples  of  slow  motions.  In  Section  4  we  dsicuss  the  existence  of  local  solutions  to 
the  system  of  equations  for  two  dimensional  isothermal  motion  of  compressible  fluids  of 
Korteweg  type.  The  higher  order  terms  of  density  (or  the  deformation  gradient)  in  the 
Cauchy  stress  tensor  is  not  in  genera!  compatible  with  the  classical  theory  of  thermody- 
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namics.  Dunn  and  Serrin  [6]  introduced  the  concept  of  interstitial  working  and  derived 
the  Cauchy  stress  tensor  compatible  with  thethermodynamics.  First,  we  summarize  their 
results  and  derive  the  system  of  equations.  Then,  we  state  the  theorem  for  the  existence 
of  local  solutions. 

2  Slow  motions  one-dimensional  case 

In  this  section  we  summerize  the  results  in  [2],  [8],  and  [11],  Multiply  (1.1)  by  ut,  integrate 
in  x  and  t,  and  then  integrate  by  parts  using  (1.2).  After  dividing  by  e,  we  have, 

(2.1)  E[u](t)  +  -f  f  vu\t(x,  s)dxds  =  £[u](0), 

c  Jo  Jo 

where 

(2.2)  £[«](()  =  +  \(WM  -  f“«)  +  |>4}(i,<)<fc. 

In  (2.2)  W(ux)  is  a  primitive  of  a.  For  the  remainder  of  the  paper  we  shall  assume  that  P 
—  <r(a.\f ).  This  implies  that  W(ux)  —  Pux  will  be  double-well  potentials  with  equal  depth. 
For  the  sake  of  simplicity  we  shall  also  assume  that  W(ux )  -  Pux  is  given  by  ( ux  -  l)2. 
The  same  conclusions  will  hold  for  more  general  non-linearities. 

Observe  that  (2.1)  is  similar  to  that  for  the  parabolic  equation 

(2.3)  vvt  =  e2vxs  —  (u3  —  v), 
with  either  the  homogeneous  Neumann  boundary  condition 

ux(0,f)  =  0,  ux(l,£)  =  0 


or  a  Dirichlet  condition 


u(0,f)  =  a,  v(l,t)  =  b,  a,b  =  ±  1. 

In  particular  the  energy  relation  for  (2.3)  is  given  by 

(2.4)  Ev[v](t)  +  -  [  f  Mf(x,s)  dxds  =  Ep[v](Q), 

e  Jo  Jo 

where 

(2.5)  £pM(0  =  Jo  ~  !)2  +  5* 'l)(x,t)dx. 

Now  we  summarize  the  results  of  the  slow  motions  for  the  parabolic  equation.  We 
assume  for  the  initial  data  of  (2.3)  that 

(2.6)  w(x)  —  lim  uf(ar,  0) 
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exists  as  a  limit  of  L 1  norm,  where  w  is  a  piecewise  constant  function  taking  only  the  values 
±1,  with  exactly  N  discontinuities  at  {xi,  •  •  • ,  x,v)  and  we  also  assume  that  the  initial  data 
satisfy 

(2.7)  FP[i/e](0)  <  Nc0  +  K2  exp (-Kft). 

Then,  we  have 

Lemma  2.1  Suppose  the  initial  data  for  (2.3)  satisfy  (2.6)  and  (2.7).  Then,  for  any  T 
satisfying  0  <  T  <  Fvta  exp(—K/e),  we  have 

(2.8)  sup  f1  |u£(x,  t)  -  ve(x,  0)j  dx  <  (FG)l/2e?(,+l). 

0<t<T  do 


Next,  we  summarize  the  results  concerning  the  slow  motions  of  (1.1 ).  As  the  form  of  the 
energy  relation  (2.1)  resembles  (2.4),  we  can  expect  to  draw  the  same  kind  of  conclusions 
for  (1.1).  For  this  purpose  we  rewrite  the  energy  Ec[u\  as 

Ec[u\  =  E,[u]  +  Ep[ux],  £s[u]  =  ^u2(x,  ^  dx~ 

We  assume  that  the  initial  data  for  (1.1)  satisfy 

(2.9)  u£  (x,  0)  =  u£(x,0) 
and 

(2.10)  E,[ue](0)  <  Cexp(— 7v/e). 

The  condition  (2.9)  is  imposed  for  the  sake  of  simplicity.  As  long  as  ux(x,0)  satisfies  (2.6) 
and  (2.7),  with  v  being  replaced  by  ux,  the  same  conclusion  should  be  obtained. 

Lemma  2.2  Suppose  (2.9)  and  (2.10)  are  satisfied.  Then,  for  any  T  satisfying  0  <  T  < 
Fci/ea  exp(K/e),  the  solution  to  (1.1)  satisfies 

(2-11)  sup  I"  | u‘(*,0  -  u£(x,0)| dx  <  (FcGc)1/‘2e*{,+  l]. 

0<t<TJ0 


Using  Nirenberg’s  inequality  and  Lemmas  2.1  and  2.2,  we  can  show 

Theorem  2.3  If  s  >  1,  then  the  difference  in  the  L°°  norm  between  ulx  and  ve  is  0(e^3-l)) 
for  at  least  0  <  t  <  Fmve3  exp(K/e),  where  Fm  =  min{F.  Fc ). 
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One  of  the  reasons  why  we  change  the  magnitude  of  the  initial  data  is  to  see  how  this 
influences  the  metastable  states.  We  should  note  that  for  either  choice  of  C  above,  the 
conditions  (2.9)  and  (2.10)  are  not  satisfied.  Nevertheless,  when  C  =  1.0, 0.5,  0.1,  v  and  q 
have  reached  the  same  metastable  state  in  each  case.  Here,  we  show  the  numerical  results 
of  C  —  0.1,0.01  only.  In  Figures  3.1  and  3.2  we  show  how  v  and  q  evolve  for  0  <  t  <  10 
and  then  in  Figures  3.3  and  3.4  we  show  the  profiles  of  v  and  q  at  t  —  1000.  We  use  the 
solid  lines  for  v  and  the  gray  lines  for  q.  When  these  lines  overlap,  we  see  only  the  gray 
lines.  When  C  =  0.1,  they  agree  at  least  to  10~7  at  t  =  1000  and  this  agreement  continues 
at  least  until  t  =  10000.  In  these  figures,  the  values  of  x  should  be  multiplied  by  0.01. 
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Figure  3.2.  v  and  q  for  C  =  0.01,  0  <  t  <  10. 
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4  Local  existence  in  multidimensional  case 


Dunn  and  Serrin  [6}  modified  the  Korteweg  theory  and  derived  the  following  set  of  equations 
for  the  conservation  of  mass,  the  balance  of  linear  momentum,  the  balance  of  energy,  and 
the  Clausius-Duhem  inequality: 


(4.1) 


Pt  +  div(pu)  =  0, 

p~  =  divT, 
Dt 


Ds 


p  —  =  T  •  L  —  divq  +  divw, 

.Dt)  „  q-(grad0) 

+  divq  +  - —  >  0, 


where  =  ft  4-  u  •  V /  and 

1 .  p  —  p(x,  t)  is  the  density  of  the  fluid  at  the  point  x  at  time  t, 

2.  u  =  u(x,  t)  is  the  velocity  of  fluid, 

3.  1 9  =  0(x,t)(>  0)  is  the  absolute  temperature, 

4.  s  =  e(x,t)  is  the  specific  internal  energy  per  unit  mass, 

5.  r/  =  rj(x,t)  is  the  specific  entropy  per  unit  mass, 

6.  T  =  T(x,  t)  is  the  Cauchy  stress  tensor. 

7.  q  =  q(x,  t)  is  the  heat  flux  vector, 

8.  L  =  gradu. 


The  main  difference  with  the  classical  thermodynamics  is  the  divw  term  and  w  is  called  the 
interstitial  work  flux  representing  spacial  interactions  of  longer  range.  They  have  proved 
that  for  a  given  Helmholtz  free  energy  v(p,0.d),  the  following  forms  of  w  and  T 


w  =  ppii'd  +  w. 

T  =  (—  p2wp  +  pd  ■  1 +  /rV  •  !”d)I  -  pd  C 


are  compatible  with  (4. Id).  Here.  p~>L'P{p.  0,Q)  is  the  pressure  and  w  is  the  ‘‘static"  portion 
of  the  interstitial  work  flux  w.  They  have  shown  that  if  the  material  possesses  a  center  of 
symmetry,  w  =  0.  In  what  follows,  we  consider  the  materials  which  posess  the  center  of 
symmetry.  They  also  have  observed  that  the  classical  forms  of  viscosity  and  conductivity 
tensors  are  compatible. 
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In  this  note  we  state  a  result  concerning  the  existence  of  a  unique  local  smooth  solution 
in  the  two-dimensional  isothermal  motion  of  the  Korteweg  type  materials  where  the  viscous 
effect  is  also  included.  The  3-dimensional  case  can  be  discussed  similarly.  In  what  follows, 
we  state  the  assumptions  on  the  Helmholtz  free  energy  and  derive  the  system  that  we  shall 
discuss.  We  assume  that  the  Helmholtz  free  energy  is  given  by 

(4-3)  ^  =  F(p)  +  ~(pl  +  pl)1 


where  F  is  a  smooth  function  of  p  and  u  is  a  positive  constant.  This  choice  is  to  make  the 
terms  appearing  in  (4.4)  as  simple  as  possible,  yet  to  reflect  the  effect  of  the  higher  order 
terms  of  p.  1 

With  the  choice  the  Helmholtz  free  energy  given  in  (4.3)  and  with  A  =  — p,  the  system 
then  becomes 

Pt  +  (PU)X  +  (pV)y  =  0, 

(4.4)  ( pu)t  +  ( pu2)x  +  ( puv)y  =  (T„)r  +  ( Tn)y , 

( pv)t  +  {puv)r  +  (pv2)  y  =  (T2l)x  +  (T22)  y, 
where  u  and  v  are  the  x  and  y  component  of  velocity  and 


(4.5) 


(  Tn  Tn  \ 

\  ^21  T22  ) 


PzPy  Py  ) 


(4.6)  p  =  p2F\p ), 

and 

Vu  Vn 
V VA2 

-  /i{(gradu)  i-  igraau):  -  ^divu)I}. 

Here,  I  is  the  unit  rank-two  tensor  and  a  superscript  T  denotes  the  transpose  of  a  tensor. 
Since  we  discuss  the  existence  of  a  local  solution,  we  do  not  need  the  monotonicity  of  the 
pressure  on  p.  Further  computation  simplifies  the  divT  term 


(4.8)  divT  =  — Vp  +  i/pV(Ap)  +  divV. 

We  discuss  the  local  existence  for  the  pure  initial  value  problem  of  (4.4)  with  the  initial 
data  given  by 

(4.9)  (p,  u,v)(x,y,  0)  =  (p0,u0,u0)(x,y). 

Another  reasonable  choice  is  to  change  the  last  term  in  (4.3)  with  ^(pl  +  p\)-  Although  this  choice 
may  be  physically  more  realistic,  mathematically  it  is  more  cumbersome  to  handle.  For  example,  the 
expression  for  divT  is  very  complicated.  Therefore,  we  do  not  discuss  this  case  (See  (4.8)). 
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We  assume  that  the  initial  data  satisfy 


(4.10)  (pa  —  po,  uo,  t’o)  €  Hk(R2),  Po  >  $  >  0, 

where  k  >  4  and  po  >  0  is  a  positive  constant.  Denote  by  )|  -  ||  =  ||  •  Jj0  the  L 2  norm  and 
by  ||  •  [|*  the  fc-th  order  Sobolev  norm.  Set 

Mir  =  ojupt  (|M<)IIJ  +  II Vp(i)f)  +  £  (II Vu(()||J  +  II' VvtiJII2)  it 

and 

MJ  =  E 

iji<fc 

where  w  =  ( p,u,v ).  The  main  result  is  stated  as  follows. 

Theorem  4.1  For  any  initial  data  (p0,  u0,  v0)  such  that  po  >  <5  >  0  and  (p0  —  p0,  uo,  t’o)  € 
Hk(R2)  ( k  >  4)  where  p0  >  0  is  a  constant,  there  exists  a  T  >  0  such  that  in  t  6 
[0.T],  the  Cauchy  problem  (4.4),  (4.9)  has  a  unique  solution  ( p,u,v )  such  that  p  —  p0  € 
L°°([Q,T]\Hk+1{R2))  and  (u,v)  E  L°°({0,T}-,  Hk(R2))  and 

MJ<c*iw2  +  lM2+l. 

Since  the  linearized  problem  of  (4.4),  (4.9)  is  not  of  any  classical  type,  the  existence  of 
solutions  is  not  known  even  for  the  linearzed  problem.  We  prove  the  existence  of  solutions 
for  the  linearized  problem  by  establishing  an  energy  estimate  for  the  dual  problem  and 
then  using  the  dual  argument. 
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Abstract 


The  Singular  Value  Decomposition  (SYD)  is  a  matrix  tool  that  plays  a  critical  ’-ole  in  many 
applications:  for  example,  in  signal  processing,  it  is  often  necessary  to  calculate  the  SYD  in  real 
time.  \Ye  present  here  a  new  technique  for  computing  the  SYD  on  a  parallel  architecture  whose 
processors  are  connected  via  a  fat-tree.  \Ve  tested  our  idea  on  the  Connection  Machine  C’M-5.  and 
achieved  efficiency  up  to  A0V<  even  for  moderately  sized  matrices. 
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1  Introduction 

Let  .4  be  a  real  m  x  n  matrix.  Its  singular  value  decomposition  (SVD)  is  given  by 

.4  =  USVT. 

where  V  and  V  are  respectively  m  x  m  and  n  x  n  orthogonal  matrices  and  E  is  an  m  x  n  diagonal 
matrix.  The  best  approach  to  parallel  SVD  computation  is  apparently  one  of  the  Jacobi  type;  see. 
e.g..  [1],  [2].  [4],  [5].  [7],  [11],  [12].  In  this  paper,  we  will  discuss  the  efficient  implementation  of  a 
Jacobi  method  on  a  parallel  computer  with  a  fat-tree  interconnection  network.  We  will  propose  a 
new  Jacobi  ordering  for  a  fat-tree  and  analyze  its  behavior  both  theoretically  and  experimentally 
(on  a  Connection  Machine  CM-o). 

This  paper  is  organized  as  follows.  In  the  next  two  subsections,  we  present  the  fat-tree  ar¬ 
chitecture  and  Jacobi  algorithm.  Section  2  introduces  a  new  fat-tree  ordering,  and  provides  some 
kernel  programs.  We  analyze  communication  costs  on  a  fat-tree  network  in  Section  3.  and  discuss 
implementation  results  on  the  CM-o  in  Section  4. 

1.1  Fat-Tree  Architecture 

The  fat-tree  was  introduced  by  Leiserson  [10]  as  a  novel  approach  to  interconnect  the  processors  of 
a  general-purpose  parallel  supercomputer.  This  communication  structure  can  also  be  seen  in  the 
distributed  computing  environment,  such  as  a  network  of  workstations. 

The  routing  network  of  the  Connection  Machine  CM-o  [14]  is  based  on  the  fat-tree.  This  parallel 
machine  consists  of  up  to  544  (=  512  +  32)  nodes  for  the  model  at  the  Army  High  Performance 
Computing  Research  Center  (AHPCRC)  at  the  University  of  Minnesota,  and  32  nodes  at  the 
Northeast  Parallel  Architectures  Center  (NPAC)  at  Syracuse  University.  Each  node  of  the  CM-o 
is  a  SPARC  chip  which  runs  at  32  MHz  and  delivers  22  Mips  and  5  Mflops.  There  is  a  64  Kbyte 
instruction  and  data  cache  and  a  16  Mbyte  memory  in  each  node.  All  the  nodes  are  synchronized. 
In  October  of  1992.  two  vector  units  will  be  installed  in  each  processing  node:  each  vector  unit 
is  capable  of  64  Mflops  peak  and  40  Mflops  sustained  [9].  The  control  and  data  networks  are 
connected  via  a  skinny  fat-tree  structure.  By  skinny,  we  mean  that  the  bandwidth  does  not 
increase  proportionately  to  the  number  of  nodes;  in  particular,  the  bandwidth  is  20Mbyte/sec  per 
node  in  a  group  of  four  processors.  10  Mbyte/sec  per  node  in  a  group  of  sixteen,  and  5Mbyte/sec 
overall.  So  data  contention  may  severely  degrade  performance  when  all  nodes  need  to  access  a  iarge 
set  of  data  front  other  nodes  through  the  top  level  of  the  tree. 

1.2  Jacobi  Algorithm 

The  one-sided  Jacobi  method  [8]  generates  an  orthogonal  matrix  U  such  that  the  columns  of  the 
matrix  U'.  given  by  IV  —  ,4U.  are  mutually  orthogonal.  The  matrix  V  can  be  generated  by  a 

sequence  of  plane  rotations  I'C)  y  (2( . where  each  U(t)  is  an  identity  matrix  except  for  four 

entries:  =  cos#.  =  -  sin#,  =  sin#  and  -  cos#,  where  (i.j)  represents  the 

index  pair  of  the  columns  of  .4  that  V,A'>  orthogonalizes.  The  SVD  computation  requires  0(nm2) 
operations  for  an  n>  x  n  matrix  .4.  For  a  limited  number  of  processors,  i.e..  up  to  »/2  processors, 
an  efficient  way  is  to  configure  them  as  a  linear  array  along  the  horizontal  dimension.  Columns  can 
be  distributed  either  in  blocks  or  in  a  wraparound  fashion.  Note  from  the  above  derivation  that 
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each  column-pair  can  be  orthogonalized  independently,  so  that  we  may  transform  up  to  p  pairs 
concurrently,  where  p  denotes  the  number  of  processors.  This  method  was  used  for  computing  the 
SVD  on  special  machines,  e.g..  parallel  computers  such  as  the  Illiac  IV  [11]  and  vector  processors 
such  as  the  CYBER  205  [3],  The  one-sided  Jacobi  method  is  composed  of  these  major  steps: 

1.  Compute  the  norm  of  each  column. 

2.  Compute  plane  rotations  to  orthogonalize  paired  columns. 

3.  Apply  the  plane  rotations  to  update  the  columns  and  the  column  norms. 

4.  Permute  the  columns  in  a  pre-chosen  order  to  generate  the  next  column  pairs,  and  repeat  the 
process  from  step  2  . 

If  the  column  pairs  are  distributed  to  different  processors,  then  step  4  requires  communication. 
In  the  case  of  a  two-dimensional  mesh  (as  in  the  ILLIAC  IV).  each  column  is  itself  distributed 
among  different  processors  and  step  3  requires  that  the  rotation  parameters  be  transmitted  to  all 
the  processors  containing  each  given  column  pair.  In  the  case  of  a  one-dimensionr.l  array,  each 
column  pair  is  stored  entirely  in  one  processor  and  significant  speedup  is  possible  if  vector  units 
are  present  within  each  processor. 

In  this  paper,  we  use  the  one-dimensionai  array,  with  each  processor  storing  two  blocks  of 
columns.  That  is.  we  use  a  block  Jacobi  algorithm,  in  which  the  column  blocks  are  circulated 
according  to  a  given  ordering  to  be  defined,  and  the  cyclic-by-rows  ordering  [6]  is  used  within  each 
block. 

2  New  SVD  Algorithm 

In  the  past,  when  the  hypercube  interconnection  topology  was  in  vogue,  several  Jacobi  ordering 
schemes  were  proposed  [l],  [4].  [7]  to  utilize  the  hypercube  structure.  Here,  for  a  one-dimensional 
array  of  processors  with  no  wraparound,  a  chess-tournament  ordering  [2]  may  be  chosen  because 
it  does  not  waste  processing  power  or  memory  space.  However,  communication  requires  a  two-way 
transmission  of  columns  between  adjacent  processors.  An  alternative  is  a  ring  ordering  [4]  which 
uses  only  one-way  transmission,  but  it  requires  a  wraparound  connection.  To  develop  an  ideal 
ordering  for  a  fat-tree,  we  aim  to  minimize  the  total  path  length  by  using  the  extra  bandwidth  of 
a  fat-tree. 

2.1  Fat-Tree  Ordering 

It  is  easiest  to  describe  this  ordering  by  an  example.  In  Figure  1  we  show  the  case  for  sixteen 
columns  and  eight  processors.  For  pedagogic  reasons,  we  use  a  base  8  numbering  of  the  indices 
and  so  A=S.  B=9.  ....  H=lo.  The  XOR  (exclusive-or)  column  is  the  binary  XOR  of  the  column 
indices:  at  each  step,  the  XOR  value  of  each  index  pair  is  the  same,  and  from  one  step  to  the  next 
this  quantity  follows  the  Gray  code.  The  cost -to- this- step  column  denotes  the  maximum  number 
of  levels  up  the  tree  the  messages  must  travel  to  reach  their  destinations  from  the  previous  step. 
In  general,  if  there  are  p  processors  and  two  columns  per  processor,  then  a  sweep  requires  2p  —  2 
steps.  We  save  one  step  per  sweep  because  the  last  step  of  sweep  i  can  be  included  as  the  first  step 
for  sweep  i  +  1. 
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Figure  1.  Fat-tree  Ordering  based  on  the  Gray  code 
(eight  processors  and  sixteen  columns). 

2.2  Kernel  Programs 

To  see  how  to  write  a  simple  node  program  to  generate  the  fat-tree  ordering,  we  use  t he  following 
observations  from  the  example  in  Figure  1.  To  simplify  the  presentation,  we  consider  only  the 
forward  sweep.  At  each  step,  each  processor  must  communicate  with  a  remote  processor  whose 
label  differs  in  one  bit.  The  basis  for  our  kernel  presented  here  is  to  compute  a  mask  such  that  the 
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exclusive-or  of  the  mask  with  the  current  processor  label  yields  the  remote  processor  label.  When 
using  the  Gray  code,  this  mask  can  be  computed  using  only  the  step  number  -  it  is  independent  of 
the  processor  label. 

We  also  use  the  following  observations.  First,  we  use  the  fact  that  the  XOR's  follow  the  Gray 
code.  Second,  we  observe  that  during  the  second  half  of  the  forward  sweep  (steps  7-14).  the  lower 

half  of  the  columns  (numbers  0 . 7  in  Figure  1)  remain  fixed  in  the  processor  with  the  same 

number.  Hence  the  location  of  the  remaining  columns  is  fixed  entirely  by  the  Gray  code.  Third, 
we  observe  that  the  first  half  of  the  steps  (steps  0-6)  amount  to  doing  a  Gray  code  fat-tree  ordering 
on  each  half  of  the  processor  array  separately.  The  only  remaining  step  is  the  transition  from  the 
first  half  to  the  second  half  (step  6  ro  step  7).  Hence  we  can  define  the  ordering  for  these  steps 
recursively  from  the  smaller  cases. 

We  can  summarize  the  steps  for  the  forward  sweep  in  the  following  procedure,  in  a  pseudo- 
MATLAB  notation  assuming  for  the  sake  of  simplicity  of  the  presentation  that  the  sends  do  not 
block. 

'/,  Node  program  for  processor  ProcNo  for  one  forward  sweeo  using  sun  array  of 
'/,  NProcs  processors.  Assume  Column(l)  and  Column(2)  are  the  head  and  tail 
*/,  columns,  respectively,  in  the  local  memory. 

Orthogonalize_Individual_Column_Blocks  '/,  (within  each  block) ; 

for  StepNo  =  1 : 2*NProcs-2 , 

Pairwise_Orthogonalize_Column_Blocks ; 

for  each  processor,  figure  where  the  data  goes  to  and  send  it. 

[Mask ,ColumnSwitch]  =  MakeMask(StepNo, ProcNo, NProcs) ; 

Remot6ProcNo  =  X0R(ProcNo ,Mask) ; 

Send  Column(2)  to  remote  processor  RemoteProcNo ; 
if  ColumnSwitch  ==  rotate, 

Column(2)  =  Column(l); 

Column(l)  =  receive_from(RemoteProcNo) ; 
else 

Column(2)  =  receive_from(RemoteProcNo) ; 
end ; 

end ; 
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function  [Mask, ColumnSwitch]=MakeMask(StepNo, ProcNo, NoProcs) ; 

7,  ColumnSwitch  indicates  which  column  of  the  pair  is  to  be  sent/received. 

7,  Mask  is  the  XOR  Mask  so  that  RemoteProcNo  =  X0R(ProcNo  .Mask) . 

7  The  Mask  is  computed  independent  of  the  processor  label  ProcNo. 

7,  Handle  first  2  steps  as  speci.d  cases  to  start  recursion 
if  StepNo  <=  2, 

Mask=l ; 

ColumnSwitch  =  tail; 

if  rem(ProcNo ,2)  --  1  &  StepNo  ==  1,  ColumnSwitch  -  rotate;  end; 

7.  First  half  of  sweep:  pretend  this  is  a  separate  fat  tre  •  sweep  on  each 
7.  half  of  the  processor  array, 
else  if  StepNo  <  NoProcs-1, 

[Mask, ColumnSwitch]  =  MakeMaskCStepNo ,rem(ProcNo ,NoProcs/2) ,NoProcs/2) ; 

7.  Middle  of  sweep:  here  is  first  exchange  through  top  of  tree, 
else  if  StepNo  ==  NoProcs-1, 

Mask  *  NoProcs/2; 

ColumnSwitch  =  tail; 

if  ProcNo  >=  NoProcs/2,  ColumnSwitch  =  rotate;  end; 

7.  Last  half  of  sweep:  only  tail  columns  move,  figure  Mask  using  Gray  codes, 
else  if  StepNo  >  NoProcs-1, 

Mask  =  xor(gray (StepNo) ,gray(StepNo+l)) ; 

ColumnSwitch  =  tail; 

end; 

2.3  Test  of  Convergence 

For  a  fat- tree  ordering,  any  consecutive  2 p  —  2  ( or  even  2 p  —  1)  steps  may  not  constitute  one  sweep, 
lie  must  complete  a  sweep,  either  forward  or  backward,  to  ensure  that  all  column  pairs  have  been 
orthogonalized.  The  convergence  test  is  simple.  We  maintain  a  one-bit  counter  in  every  processor. 
The  counter  is  reset  at  the  beginning  of  every  sweep,  and  is  set  whenever  a  column  pair  needs  to 
be  orthogonalized.  At  the  end  of  the  sweep,  a  global  or  operation  is  performed  and  convergence  is 
achieved  if  no  bit  has  been  set. 


148 


3  Analysis  on  a  Binary  Fat-Tree  Network 

We  consider  a  binary  fat-tree  with  p  processors,  and  assume  that  the  communication  time  from 
one  processor  to  another  is  determined  by  the  number  of  links  a  message  has  to  traverse  and  the 
capacity  of  these  links.  Our  assumption  is  supported  by  experimental  results  reported  in  [13]. 
Define  a  channel  to  be  the  communication  link  between  any  two  adjacent  nodes:  here  a  node  can 
be  a  processor  or  an  internal  switching  element.  The  capacity  of  a  channel  equals  the  number  of 
parallel  wires  in  the  channel,  and  thus  the  maximum  number  of  simultaneous  bit-serial  messages 
it  can  support  [10].  Denote  the  capacity  of  the  channels  at  the  bottom  level  by  .  Label  the  levels 

from  bottom  up  as  level  1.  2 . so  that  the  capacity  of  the  channels  at  level  /  is  given  by  2l~l~, . 

Let  us  ignore  start-up  and  latency  costs.  Within  a  single  problem,  all  the  messages  have  the  same 
size  and  thus  we  measure  the  cost  of  multiple  message  transmission  using  path  length. 

For  the  ring  ordering,  at  each  step  a  message  always  goes  through  the  top  level  and  the  maximum 
path  length  equals  2  log/;  (unless  otherwise  stated,  we  use  base  2  logarithms).  Since  there  is  at 
most  one  message  at  each  channel,  congestion  never  occurs  and  it  takes  2 p  —  1  steps  to  complete 
one  sweep.  The  total  path  length  equals  (4 p-  2)  log/;. 

The  fat-tree  ordering  does  not  cause  congestion  on  a  fat-tree  network.  Hence  it  suffices  to  count 
the  number  of  times  that  each  level  is  used.  Denote  that  count  by  c{p.l).  Consider  the  forward 
sweep.  We  see  from  Figure  1  that  with  p  =  .'f  processors,  the  top  level  is  used  in  two  transition 
steps,  the  middle  level  in  six  steps  and  the  bottom  in  fourteen  steps.  The  first  six  steps  correspond 
to  the  fat-tree  ordering  for  the  first  four  processors,  and  also  for  the  second  four  processors.  In  the 
general  case  of  p  processors,  there  are  2p  -  2  steps  using  log  p  levels,  of  which  the  first  p  -  2  steps 
amount  to  the  ordering  for  p/2  processors.  When  the  number  of  processors  doubles  to  2p.  we  add 
a  new  top  level  and  the  first  2p  -  2  steps  correspond  exactly  to  the  p  processor  ordering.  There  are 
an  extra  2 p  steps,  of  which  two  use  the  new  top  level,  four  use  the  next  level  (the  old  top  level), 
eight  use  the  following  level,  etc.  Formally,  we  get  the  recurrence 

c(  2p.  /)  =  c( />./)  +  4(  p/21 )  for  /  =  1 . log  /;. 

starting  with  c(/;.log/;)  =  2  and  c{p.l)  =  0  for  /  >  log p.  Therefore.  c\p.J)  -  \p/2‘  -  2.  and  the 
total  path  length  is  given  by 

log  /’ 

2  ^2  P-0  =  2[{2p  -  2)  +  (/;  -  2)  4-  ...  4-  14  +  ft  +  2]  =  Sp  -  4  log p  -  *. 

I=i 


For  a  large  p.  the  path  length  ratio  of  the  two  orderings  grows  like  log  p/2,  a  very  attractive  result 
for  our  new  ordering. 
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4  Connection  Machine  CM-5 

Although  the  CM-5  network  is  a  4-way  tree,  the  analysis  on  2-way  trees  is  applicable.  We  take 
a  4-way  tree  and  expand  every  interior  node  into  a  binary  tree  consisting  of  that  node  with  two 
new  children  each  connected  to  two  of  the  four  former  children.  The  number  of  levels  as  well  as 
the  path  length  are  doubled.  However,  the  CM-5  is  skinny  and  the  capacity  only  doubles  at  every 
level.  Hence  it  becomes  a  skinny  2- way  tree  in  which  the  capacity  goes  up  by  \/2  at  each  level. 

To  simplify  our  analysis,  we  concentrate  on  the  32-processor  model.  So  p  =  32  and  there  are 
three  tree  levels  because  [log.j  p]  =  3.  The  dominating  communication  cost  for  the  CM-5  is  the 
overhead  time  that  is  spent  on  address  calculation,  buffer  space  management,  and  so  on.  Let  t0 r 
and  t0j  represent  the  cost  of  such  overhead  in  each  step  for  the  ring  and  the  fat-tree  ordering, 
respectively.  Let  tcj  be  the  overhead  cost  for  resolving  contention  in  the  channels  of  the  CM-5 
network  when  applying  the  fat-tree  ordering,  and  let  te  be  the  time  for  traversing  an  edge  in  the 
network.  We  note  that  te  <  tcJ  <  toh.  where  toh  6  {tor.tof}-  tcj  ~  toh.  and  tt  6  (f^/103.  WlO2 )• 
The  overheads  tor  and  t0j  depend  on  the  data  size  and  are  of  equal  magnitude. 

We  proceed  to  compute  the  coefficient  for  te.  which  we  assume  to  equal  the  number  of  messages 
that  traverse  the  channels  in  one  sweep.  For  the  ring  ordering,  there  is  no  congestion  in  the 
networks.  So  the  coefficient  for  te  is  2  ■  63  •  3  (=378).  and  the  total  cost  equals  63  tor  -F  378  te.  For 
the  fat-tree  ordering,  we  observe  that  level  1  is  visited  62  times,  level  2  fourteen  times,  and  level  3 
two  times.  We  model  the  resolution  of  the  contention  by  sending  messages  in  batches.  Messages 
through  level  2  must  be  sent  in  two  batches  and  messages  through  level  3  in  four  batches,  in  order 
to  avoid  contention.  Hence  we  account  for  the  thinness  of  the  CM-5  network  by  assigning  a  weight 
of  two  to  level  2  and  a  weight  of  four  to  level  3.  The  total  path  length  is  2(62  +  2  •  14  +  4  •  2)  =  196 
and  the  total  cost  equals  62  tuj  4-  196  /,  +  tcj .  Thus,  on  the  CM-5  the  fat-tree  ordering  may  not 
outperform  the  ring  ordering  because  of  the  extra  cost  associated  with  message  contention. 

4.1  Experimental  Results 

In  Table  1  we  present  implementation  results  on  a  32-node  C'M-5  for  random  n  x  n  matrices  with 
n  ranging  from  64  to  1024.  The  program  was  written  in  Fortran  and  each  experiment  repeated  ten 
times.  We  measured  the  overall  and  computation  ( by  disabling  communication)  costs  for  one  sweep, 
and  estimated  the  communication  cost  by  subtracting  the  latter  from  the  former.  Our  results  show 
that,  despite  the  message  congestion  that  it  causes  on  the  CM-5.  the  fat-tree  ordering  gets  more 
competitive  as  n  grows,  justifying  our  effort  to  minimize  the  total  message  path  length  (see  also 
[13]).  The  mflops  (million  floating-point  operations  per  second)  figures  in  Table  2  are  computed 
based  on  the  count  that  8»3  flops  are  required  for  one  sweep.  We  conjecture  that  the  compute 
performance  deteriorates  when  u  gets  beyond  512  because  the  cache  is  no  longer  large  enough  to 
hold  the  huge  column  blocks.  Nonetheless,  our  implementation  results  shows  how.  as  the  message 
size  increases  (hence  t,  increases  [13]).  the  fat-tree  ordering  quickly  becomes  competitive. 
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n 

64 

128 

256 

512 

1024 

Overall 

Ring 

ISEEliSSfl 

EgSISHi 

2.628 

fBEBM 

Fat-tree 

mwmm 

KTT3TB1 

2.237 

Compute 

ESBH 

1.871 

KSS£&I 

1.309  o2 

Fat-tree 

iSTOggl 

Sggggi 

1.878 

wmzmu 

Communicate 

Ring 

giliTiSmg 

giWflrl 

3.010 

7.110 

Fat -tree 

WiTUfSa 

0.359 

3.020 

5.140 

Table  1.  CPU  Time  (seconds)  of  Ring  and  Fat-Tree  Orderings 


n 

64 

128 

256 

512 

1024 

Overall 

Ring 

27.61 

51.96 

51.07 

59.85 

62.25 

Fat-tree 

25.78 

48.20 

60.00 

59.82 

63.11 

Compute 

kossh 

69.60 

71.74 

71.92 

65.62 

Fat -tree 

61.03 

69.33 

71.47 

71.92 

65.57 

Table  2.  .\ 


flops  Rates  of  Ring  and  Fat-Tree  Orderings 
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abstract 

We  introduce  a  framework  for  exploring  array  detection  problems  in  a  reduced  dimensional 
space  by  exploiting  the  theory  of  invariance  in  hypothesis  testing.  This  involves  calculating  a 
low  dimensional  basis  set  of  functions  called  the  maximal  invariant,  the  statistics  of  which  are 
often  tractable  to  obtain,  thereby  making  analysis  feasible  and  facilitating  the  search  for  tests 
with  some  optimality  property.  Using  this  approach,  we  obtain  a  locally  most  powerful  test  for 
the  unstructured  covariance  case  and  show  that  the  Kelly  and  AMF  detectors  form  an  algebraic 
span  for  any  invariant  detector.  Applying  the  same  framework  to  structured  covariance  matrices, 
we  gain  some  insights  and  propose  several  new  detectors  which  are  shown  to  perform  as  well  or 
better  than  existing  detectors.1 

Introduction 

The  problem  of  detecting  a  signal  vector  of  known  direction  but  unknown  strength  in  Gaussian 
noise  whose  covariance  matrix  is  unknown  has  received  much,  attention  lately.  In  [7],  Reed  et  al 
used  the  sample  covariance  estimate  from  secondary  (signal  free)  data  vectors  to  derive  a  weight 
vector  for  use  in  an  adaptive  matched  filter  (AMF)  detector.  Kelly^  used  the  Generalized 
Likelihood  Ratio  (GLR)  procedure  to  derive  a  constant  false  alarm  rate  (CFAR)  test.  Both 
methods  assume  that  the  covariance  matrix  is  completely  unknown  (unstructured).  In  many 
applications,  however,  the  array  geometry  and  partial  information  of  the  noise  environment 
(number  of  interferes,  rough  bearing  estimates  etc.)  impose  a  structure  on  the  covariance 
matrix.  It  has  been  shown  in  [1]  and  [4]  that  the  use  of  structured  covariance  estimates  results 
in  a  significant  improvement  in  performance  in  terms  of  gain  in  PD  and  reduction  in  the  number 
of  secondary  data  vectors  required. 

In  this  research,  we  introduce  a  framework  for  studying  the  optimality  properties  of  these 
tests.  We  consider  the  following  structure  for  the  covariance  matrix: 

R  =  VBVh  +  XRq  (1) 

where  R(N  X  N)  is  the  covariance  matrix,  ${  N  x  d )  spans  a  rank-d  subspace  and  Rq  is  a  known 
covariance  matrix.  For  this  research,  we  assume  that  'P  is  known  while  B  and  A  are  not.  This 
structure  not  only  corresponds  to  the  case  of  a  low  rank  interference  component  in  a  dominant 
subspace  (which  frequently  arises  in  narrow-band  processing  when  the  noise  has  an  interference 
component  due  to  a  small  number  of  sources  superimposed  on  the  receiver  noise  which  is  usually 
white);  but  also  as  a  special  case  reduces  to  the  unstructured  matrix  when  d  equals  N .  We  shall 
therefore  work  with  this  model  to  obtain  general  results  which  can  then  be  applied  to  specific 
instances. 

‘An  earlier  version  of  this  material  was  presented  at  the  IEEE  ASSP  conference,  March  1992,  San  Fransisco, 
CA 
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Another  case  is  the  block  diagonal  form  for  the  covariance  which  may  be  used  to  model  a 
non-stationary  environment. 

Unfortunately,  it  turns  out  that  for  these  covariance  structures,  with  the  signal  bearing  and 
waveform  known,  it  becomes  intractable  to  use  the  GLR  procedure  to  obtain  a  test  statistic. 

Consequently,  we  approach  signal  detection  from  the  viewpoint  of  the  general  theory  of 
hypotheses  testing.  We  model  the  signal  strength  /z  as  deterministic-unknown.  This  along 
with  the  unknown  covariance  matrix  become  the  parameters  describing  the  distribution  of  the 
observed  data  vectors.  The  problem  of  signal  detection  becomes  one  of  choosing  between  two 
disjoint  parameter  sets  based  on  the  observations.  Thus  we  have  the  following  hypothesis  testing 
problem: 

Given 

XnxL  ~  Af(pae^,  R  ®  I)  (2) 

where  the  columns  of  X  are  independent  data  vectors  each  normally  distributed  with  covariance 
R  as  in  (1),  a  is  the  signal  vector  (known),  possibly  present  only  in  the  first  column  and  g.  is  its 
strength  (unknown). 

Test 

HO  :  /z  =  0  versus  Hi  :  /z  ^  0 

Note  that  the  covariance  is  a  nuisance  parameter  which  should  not  affect  the  decision  statis¬ 
tic.  This  motivates  us  to  reduce  the  problem  as  follows.  Transformations  on  the  data  that 
induce  transformations  on  the  parameters  to  which  the  parameter  sets  are  invariant  leave  the 
decision  problem  unchanged.  Therefore,  the  decision  statistic  should  also  be  invariant  to  all 
such  transformations.  More  concretely,  this  can  be  formulated  as  follows: 

Let  X  be  the  data  characterized  by  the  probability  distribution  Pg,  0  £  ft  and  let  g  be  a 
1  :  1  onto  transformation  on  the  sample  space  such  that  gX  is  distributed  as  Pgi ,  6'  £  ft.  This 
transformation  thereby  induces  a  transformation  g  on  the  parameter  space.  It  is  shown  in  [5] 
that  the  set  of  all  transformations  g ,  such  that  the  corresponding  induced  transformation  g  is  a 
1  :  1  map  of  ft  onto  itself,  form  a  group. 

The  decision  problem  HO  :  9  £  fto  vs  Hi  :  8  £  ft] 
is  invariant  to  the  group  of  transformations,  G,  if  gSl{  =  ft,  i  =  0, 1  for  all  g  £  G-  In  that  case 
we  require  the  decision  statistic  to  be  invariant  to  all  transformations  in  G. 

This  principle  of  invariance  (Lehmann^)  greatly  reduces  the  class  of  detectors  to  be  con¬ 
sidered  and  frequently,  it  may  become  possible  to  find  a  uniformly  most  powerful  test  within 
this  smaller  invariant  class  (UMPI),  even  though  no  general  UMP  test  may  exist.  Often,  the 
GLR  procedure  leads  to  such  a  test.  In  our  case,  since  the  GLR  is  unavailable,  we  proceed  by 
deriving  the  group  of  transformations  that  leave  the  problem  invariant.  From  this,  we  obtain 
the  maximal  invariant,  which  is  the  algebraic  basis  for  the  largest  set  of  independent  functions 
of  the  data  that  are  invariant  to  the  transformations.  These  functions  separate  the  sample  space 
into  orbits  or  invariant  subsets.  Thus,  M{X)  is  a  maximal  invariant  iff 

M(X)  =  M(g(X)),Vg£G  (3) 

M(X\)  =  M ( A'2)=s-  A’i  =  ( A' 2 )  for  some  g  £  G 

It  is  shown  in  [5]  that  all  invariant  test  statistics  are  functions  of  the  maximal  invariant,  whose 
distribution  depends  on  a  reduced  parameter  set  (this  may  eliminate  the  nuisance  parameters 
from  the  problem,  which  is  a  very  desirable  feature).  The  maximal  invariant  turns  out  to  be  a 
small  set  and  it  is  feasible  to  come  up  with  a  reasonable  test  statistic. 
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A  Maximal  Invariant  Framework 


To  begin,  consider  as  a  special  case  of  (1)  the  following  structure 


R  = 


R+  0  \ 

0  <r2I(N-d)L  j 


(4) 


This  is  completely  equivalent  to  equation  (1)  with  R^  =  B  +  a1  Id,  since  it  can  be  obtained 
by  a  known  linear  transformation  on  the  data.  Again,  for  the  same  reason,  we  can  assume  the 
following  form  for  the  signal  vector,  a,  and  partition  the  data  matrix  accordingly: 


'  ai  ' 

Ill 

112  " 

a  = 

0 

,  x  = 

*21 

X22 

a2 

131 

x32 

0 

.  X41 

*«  . 

where  x\\  is  1  x  1,  is  1  x  (L  —  1),  X21  is  (d  -  1)  x  1,  X31  is  1  x  1  and  x+x  is  ( N  —  d)  x :  1 


We  can  represent  this  matrix  as  a  length-NL  vector 


(6) 


where  vec(A)  =  [A?  A%  . . .  A %}H  for  A  -  [Aj  :  A2  :  •  •  • :  A*r] 

This  is  distributed  as  a  Gaussian  vector  with  mean  fia  and  covariance  diag[  R^®Il  1(N-d)L) 
where  ®  denotes  the  Kronecker  product. 

This  decision  problem  is  invariant  to  all  transformations  which  preserve  the  Gaussian  r**nre 
of  the  distribution,  the  mean  vector  to  a  scale  factor  and  the  structure  of  the  covariance  matrix. 
The  largest  group  of  such  linear  transformations  are  given  by  T(x)  =  Gx  where 


'l 

0 

\ 

n 

f  1  0  N 

V 

u 

V  0  u*  j 

/ 

(7) 


where  U\(L  -  1)  and  U?((N  -  d)L  -  1)  are  unitary  matrices,  (31*  is  (1  x  L  -  1)  and  Y  is 
(tV  —  1  x  L  —  1), 

We  show  in  [8]  that  the  maximal  invariant  to  this  group  of  transformations  is  given  by 


771 1 


m  2 


\\xn  -  xnx^{x22x^2)  1x2i[12 
*12(/  “  ^2(X22X^)-'1X2'2)X^2 

x2lix22x22)  *  X21 
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(8) 


11^31  II2 

||*32||2  +  ||*4l||a  +  ||x«||J. 

*11  ~  *12*&(*22*22)-1*21 
*31 

A  corresponding  maximal  invariant  in  the  parameter  set  is  given  by 

0i  =  |/*|2|®i|2(rt*)n 

02  =  *2(^)n  (9) 


m3  = 

m4  = 


Thus  we  have  greatly  reduced  the  dimensionality  of  the  problem.  We  obtain  the  density 
function  for  the  maximal  invariant  (8]  which  is  now  parameterized  by  9\  and  61  above.  We  show 
there  that  no  UMP  test  exists  for  this  problem.  Further,  since  82  is  a  free  parameter  even  under 
HO,  the  distribution  function  of  the  maximal  invariant  is  not  completely  specified  thereunder  and 
hence  an  invariant  decision  statistic  will,  in  general,  not  have  the  CFAR  property.  Approximate 
CFARness  is  all  one  can  hope  for. 

For  the  unstructured  case,  the  maximal  invariant  reduces  even  further,  to  mi  and  m2  and 
the  corresponding  parameter  set  to  a  single  parameter  9 1  (which  is  the  SNR).  The  distribution 
function  under  HO  only  depends  on  the  dimension  of  the  data  set,  and  so  in  this  case,  any 
invariant  decision  rule  will  be  CFAR.  Again,  in  this  case  no  UMP  test  exists.  However,  in 
many  applications  the  performance  is  critical  only  for  low  SNR  and  a  locally  most  powerful 
invariant  test(LMPI)  in  the  limit  of  zero  SNR  is  of  interest.  Since,  the  parameter  space  is  one 
dimensional,  it  becomes  feasible  to  obtain  the  LMPI  test  statistic  following  the  theory  in  [6]. 
The  LMPI  decision  rule  in  the  limit  of  0q  is  given  by: 


f Oq 


(10) 


where  f  is  the  density  function  of  x  with  parameter  6. 

In  [8]  we  derive  the  following  density  function  for  mj  and  m2: 


f(mi,m2) 


*  1 


(1  +  m2)~;Vm*  2 
(1  +  171)+  m2)i-,v 


_ Li _ 

1  +  m  1 


L-N  a 

y'  k  / _ mi 

“  (1  +  m2)(l  +  mi  +  m2) 


)‘ 


(ID 


where  k\  =  N-V.L-N-U  and  =  TTtfzk ! 

Applying  the  rule  in  (10),  we  obtain  the  following  LMPI  test: 


(L-N)tK-  1 
( 1  +  m2)(tK  +  1 )  Ho 


where  ttf  =  mj/(l  +  m2)  is  the  Kelly  statistic. 

The  probability  of  false  alarm  for  this  test  is  given  by 


PF  A  =  (1  - 


T  \L- 1/  L-N  L_N 
L-N1  '  L  -  N  +  l 


(12) 


(13) 
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for  r  >  0.  The  detection  probability  is  calculated  numerically  as  a  finite  sum  of  simple  integrals. 
Preliminary  comparisons  with  the  Kelly  statistic  indicate  a  slightly  better  performance  at  very 
low  SNR  (a  gain  of  0.1  dB  for  N  =  4,  L  =  9,  p/a  =  .1  at  -5dB  SNR)  at  the  expense  of  a 
degradation  in  the  higher  SNR  region  (0.3dB  loss  at  lOdB  SNR).  Further  simulations  are  in 
progress. 

Finally,  we  note  that  mt  is  exactly  the  AMF  statistic  and  the  Kelly  statistic  is  mi/(l  +  m2). 
Thus  these  two  form  an  equivalent  basis  set  to  mi  and  m 2.  This  implies  that  they  form  an 
algebraic  basis  for  all  invariant  detectors  and  in  searching  for  viable  detectors,  it  is  sufficient  to 
look  at  compositions  of  them.  It  is  not  necessary  to  explore  alternative  ways  of  projecting  down 
the  initial  raw  high-dimensional  data. 


Detectors  for  subspace  covariance  structures 

For  the  structure  in  (  4),  the  UMPI  test  does  not  exist  and  the  notion  of  LMPI  test  is  not  directly 
applicable  either.  However  based  on  considerations  of  the  maximum  likelihood  estimates  of  the 
covariance  from  the  signal  free  data,  we  obtain  an  invariant  test  which  reduces  to  the  Kelly 
statistic  for  the  unstructured  case: 


Where 


|| 5~*z ||2 

aHS~la  Ha 


S  —  d  iag 


(  1 

*12 

\  tH  tH  1  \ 

*22 

1  x12  x22  J 

\  (N-d)L- 

-1  / 

(14) 


(15) 


with  the  partitioning  of  the  data  and  the  signal  vector  as  in  (5)  This  is  shown  to  be  approximately 
CFAR  and  the  simulation  results  in  [8]  show  that  it  outperforms  the  Kelly  test  applied  to  the 
data  truncated  to  the  span  of  the  interference  and  signal  spaces.  In  fact,  it  does  nearly  as  well 
as  the  clairvoyant  colored  noise  matched  filter  whose  weights  are  based  on  the  true  covariance 
and  which  therefore  bounds  the  best  achievable  performance. 

In  some  cases,  the  noise  level  is  known  to  be  of  the  same  order  of  magnitude  in  each  of  the 
subspaces.  This  situation  may  arise  for  block  diagonal  covariance  matrices  modelling  certain 
kinds  of  non-stationary  environments.  In  this  case,  a  CFAR  test  is  shown  to  perform  almost  as 
well  as  the  test  proposed  by  Kelly  in  [2]. 


Conclusion  and  Comments 

Detection  in  an  array  environment  involves  projecting  down  the  multivariate  data  to  a  scalar 
statistic.  Since  any  reasonable  statistic  must  satisfy  the  invariance  criterion,  the  maximal  in¬ 
variant  set  specifies  all  the  functions  one  need  consider  in  devising  the  detector.  Since  this  set  is 
often  small,  it  is  feasible  to  do  analysis  and  search  for  a  detector  with  some  optimality  property 
with  the  confidence  that  the  search  is  over  the  whole  class  of  reasonable  detectors.  This  frame¬ 
work  provides  an  alternate  route  to  conventional  methods  like  the  C1LRT  for  arriving  at  decision 
rules  and  further  enables  the  study  of  their  optimality  properties.  Thus,  for  the  unstructured 
covariance  case  studied  by  Kelly  and  others,  we  show  that  the  Kelly  and  the  AMF  statistics  form 
a  maximal  invariant  set.  Further,  we  show  that  a  UMPI  does  not  exist  and  obtain  a  LMPI  test 
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around  0  SNR.  The  Kelly  detector  is  seen  to  perform  nearly  as  well  which  is  a  good  endorsement 
for  its  use.  For  the  structured  covariance  case  where  the  GLRT  breaks  down,  we  again  obtain  a 
small  invariant  set  whose  statistics  can  be  analyzed.  For  this  case,  the  UMP  test  does  not  exist. 
We  propose  several  new  tests  and  show  via  simulation  that  they  are  equivalent  or  better  than 
existing  ones. 
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Abstract 

With  a  constant  need  to  solve  scientific  and  engineering  problems  of  ever 
growing  complexity,  there  is  a  corresponding  need  for  software  tools  that  assist 
in  generating  solutions  with  minimal  user  involvement.  Parallel  computation  is 
becoming  indispensable  in  solving  the  large-scale  problems  that  arise  in  science 
and  engineering  applications.  Adaptivity  is  at  the  center  of  efficient  methods  for 
solving  partial  differential  equations  often  used  in  such  applications.  Yet  the  use 
of  parallel  computation  and  adaptive  techniques  is  limited  by  the  high  cost  of 
developing  the  needed  software.  To  overcome  this  difficulty,  we  advocate  a  com¬ 
prehensive  approach  to  the  development  of  scalable  architecture-independent 
software  for  adaptive  solutions  of  partial  differential  equations. 

Our  approach  is  based  on  program  decomposition,  parallel  computation 
synthesis  and  run-time  support  for  adaptive  computations.  Parallel  program 
decomposition  is  guided  bv  the  source  program  annotations  provided  by  the 
user.  A  family  of  annotation  languages  has  been  designed  for  this  purpose. 

The  synthesis  of  parallel  application  is  based  on  configurations  that  describe 
overall  computation  and  interaction  of  its  components.  Run-time  support  is  re¬ 
sponsible  for  redistributing  data  and  computation  during  program  execution  in 
response  to  changing  computational  needs  of  different  subregions  during  adap¬ 
tive  solution.  Adaptive  finite  difference  and  finite  element  procedures  tuned  to 
a  specific  size  and  type  of  parallel  architecture  will  be  synthesized  from  compo¬ 
nents  of  a  decomposed  source  programs.  In  this  paper,  we  discuss  annotations 
and  configurations  suitable  for  parallel  programs  written  in  FORTRAN  or  in 
the  functional  parallel  programming  language  called  EPL. 

'This  work  was  partially  supported  by  National  Science  Foundation  under  grants  CCR-8920694 
and  CDA-8805910 
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1  Introduction 


Several  problems  listed  as  “grand  challenges”  of  the  Federal  High-Performance  Com¬ 
puting  Program  [16]  involve  the  solution  of  complex  multi-dimensional  steady  and 
transient  partial  differential  equations.  As  the  mathematical  models  include  more 
realistic  effects,  all  of  these  problems  exceed  the  capabilities  of  current  computer  sys¬ 
tems.  We  believe,  as  others  do,  that  computer  performance  in  the  needed  range  of 
teraflops  can  be  attained  only  through  massive  parallelism.  However,  raw  computing 
power  alone  is  not  sufficient  to  solve  a  complex  problem.  We  must  ensure  that  (i) 
adequate  mathematical  models  are  used,  (ii)  reliable  numerical  methods  are  employed 
to  approximate  these  models,  (iii)  accurate  parallel  implementations  of  the  methods 
are  executed,  (iv)  results  are  within  prescribed  numerical  accuracy,  and  (v)  parallel 
implementations  use  the  available  computational  power  efficiently. 

Adaptivity,  with  its  associated  error  estimation  and  shrewd  use  of  computation 
only  in  regions  where  accuracy  requirements  are  not  satisfied,  provides  the  needed 
numerical  reliability  and  efficiency.  Adaptive  solutions  often  converge  at  rates  that 
are  much  higher  than  those  obtained  by  conventional  methods  using  a  single  grid.  At 
the  same  time,  adaptive  methods  are  challenging  from  the  point  of  view  of  program¬ 
ming  complexity  because  they  use  sophisticated  data  structures,  recursion,  run-time 
domain  and  method  selection,  etc.  Parallelism  adds  to  this  challenge  because  software 
development  for  parallel  architectures  is  more  complex  than  for  sequential  machines 
due  to  the  increased  complexity  of  assuring  parallel  program  correctness  and  effi¬ 
ciency.  Parallel  program  correctness  requires  the  results  to  be  independent  of  the 
number  and  speed  of  the  processors.  This  scalability  requirement  can  be  satisfied 
only  if  the  parallel  tasks  are  independent  of  each  other  or  properly  synchronized 
when  a  dependence  exists.  Synchronization  design  and  verification  are  the  major 
source  of  difficulty  in  assessing  parallel  program  correctness.  Different  categories  of 
parallel  architectures  have  led  to  a  proliferation  of  dialects  of  standard  computer 
languages.  Varying  parallel  programming  primitives  for  different  parallel  language 
dialects  greatly  limits  parallel  software  portability.  Clearly,  the  large  efforts  required 
to  develop  and  implement  parallel  adaptive  solution  techniques  have  hampered  their 
widespread  application  by  scientists  and  engineers.  In  addition,  poor  portability  of 
parallel  programs  has  resulted  in  duplication  of  effort  and  has  limited  the  use  of 
developed  systems. 

The  aims  of  scientifir  computation  are  to  further  understanding  of  natural  phe¬ 
nomena  by  imp'ementing  and  executing  mathematical  models  when  experiments 
would  be  impractical  and/or  to  supplement  experiments  when  direct  measurements 
are  not  possible.  Large-scale  computation  requires  high  performance  parallel  archi¬ 
tectures  and  efficient  program  implementation  to  attain  acceptable  execution  times. 
To  facilitate  scientific  parallel  program  development,  there  is  a  need  for  software  tools 
that  will  support  efficiency  as  well  as- 

scalability  -  the  same  mathematical  models  and  numerical  algorithms  are  often  used 
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in  computations  with  different  accuracy  and  size  and  executed  with  a  variable 
number  of  processors;  hence,  the  cost  of  the  algorithms  used  in  the  software 
tools  should  increase  slowly  with  the  increase  in  the  number  of  processors  used 
(e.g.  the  cost  function  is  poly-logarithmic  in  the  number  of  processors  used), 

reusability  -  basic  numerical  algorithms  frequently  appear  in  different  models  and 
different  computations, 

extensibility  -  interactive  development  and  stepwise  refinement  of  mathematical 
models  describes  an  implementation  of  the  new  models  in  terms  of  changes  to 
the  old  model. 

Design  methodology  of  software  tools  with  the  above  properties  is  currently  a  research 
goal  of  great  importance.  Our  approach  to  such  design  methodology  is  based  on  de¬ 
composition  and  scalable  synthesis  of  parallel  programs  for  scientific  and  engineering 
computation.  The  goal  is  to  enable  the  users  to  describe  high-level  features  of  a  par¬ 
allel  computation  and  to  synthesize  computation  from  numerical  algorithms,  program 
fragments,  and  data  structures  that  are  separately  implemented.  Such  decomposition 
and  synthesis  can  support  (i)  parallel  task  formulation  and  allocation,  (ii)  data  dis¬ 
tribution,  (iii)  run-time  optimization,  and  ( i v)  rapid  prototyping  of  different  parallel 
implementations. 
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Figure  1:  Software  tools  and  their  use 
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The  summary  view  of  our  approach  is  given  in  Figure  1.  Program  components 
are  created  by  annotating  source  programs  in  FORTRAN  or  in  the  functional  paral¬ 
lel  programming  language  EPL  [14],  FORTRAN  programs  are  transformed  into  an 
equational  form  before  decomposition.  The  configuration  definition  guides  the  syn¬ 
thesis  of  the  components  into  a  parallel  computation.  The  synthesized  computation 
together  with  the  architecture  description  is  used  by  the  code  generator  to  produce 
an  object  code  customized  for  the  target  architecture.  In  the  future,  we  will  add  a 
scalable  library  and  an  associated  librarian  to  increase  versatility  of  the  system.  In 
Figure  1,  continuous  lines  describe  implemented  paths  of  the  system,  broken  lines 
represent  paths  currently  under  construction,  and  dotted  lines  correspond  to  paths 
at  an  early  stage  of  investigation. 

This  paper  is  intended  as  an  overview  of  the  research  done  towards  implementing 
software  tools  as  envisioned  in  Figure  1.  More  technical  discussion  cm  be  found 
elsewhere  [4,  10,  13,  14,  15]. 

The  paper  is  organized  as  follows.  Annotations  and  program  decomposition  are 
discussed  in  Section  2.  Program  synthesis  and  the  design  of  the  configurator  are 
presented  in  Section  3.  A  dynamic  load  management  strategy  for  adaptive  scientific 
computation  on  SIMD  architectures  is  a  topic  of  Section  4.  Finally,  conclusions  are 
outlined  in  Section  5. 

2  Annotations 

Annotations  provide  an  efficient  way  of  introducing  user's  directives  for  assisting  the 
compiler  in  parallelization.  To  be  effective,  annotations  have  to  be  carefully  limited  to 
a  few  constructs.  They  also  should  preserve  semantics  of  the  original  program.  In  our 
approach,  annotations  are  introduced  soiely  to  limit  the  allocation  of  computations  to 
processors.  Hence,  programs  decorated  with  annotations  produce  the  same  results  as 
unannotated  program.  Consequently,  sequential  programs  that  have  manifested  their 
correctness  over  many  years  of  usage  are  good  candidates  for  parallelization  through 
annotations.  By  being  orthogonal  to  the  program  description,  annotations  support 
rapid  prototyping  of  different  parallel  solutions. 

2.1  nnotations  in  EPL 

In  EPL,  each  equation  can  be  annotated  with  the  name  of  the  virtual  processor  on 
which  it  is  to  be  executed.  Virtual  processors  can  be  indexed  by  the  equation's 
subscripts  to  identify  instances  of  equations  assigned  to  individual  virtual  processors. 
Equation  instances  annotated  by  the  same  virtual  processor  constitute  the  smallest 
granule  of  parallel  computation.  An  example  of  the  use  of  EPL  annotations  in  a 
program  for  the  LU  decomposition  of  a  matrix  is  shown  in  Figure  2. 
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int:  n;  /*  array  size  */ 
real:  Ain[V],U[V],L[V]; 
subscript:  ij; 

range.Ain=n;  range(2).Ain=n;  range. U{j]=j-1;  range.L[i]=i; 

T(ij]:A[k,ij]  =  if  k==l  then  Ain[ij] 

else  if  i=  =  Piv[k,k]  then  A[k-l,Piv[k,kJ  j]-L[i,k-l]*U[k-l  j]; 
else  A[k-l,i j]-L[i,k-l]*U[k-l  j]; 

Dp]:  L[j,k]  =  if  j==k  then  1 

else  A[k,j,k]/U[k,k]; 

Dp]:  U[k,j]  =  A[k,Piv[k,k]  j]; 

D(i]:  Piv[k,i]  =  submax(abs(A[k.i,k]),i:i>=k); 


Figure  2:  LU  decomposition  of  a  matrix  A  in  EPL 

2.2  Annotations  in  FORTRAN 

As  in  EPL,  the  notion  of  a  virtual  processor  has  been  introduced  in  annotations  of 
FORTRAN  programs.  FORTRAN  annotations  define  blocks  of  statements  associated 
with  a  virtual  processor,  each  virtual  processor  defining  a  parallel  task.  Such  tasks 
may  include  synchronization  statements,  if  they  encompass  disjoint  blocks.  FOR¬ 
TRAN  virtual  processors  can  have  subscripts  associated  with  them  to  indicate  repe¬ 
tition.  An  example  of  an  annotated  FORTRAN  segment  for  the  LU  decomposition 
of  a  matrix  is  shown  Figure  3.  The  scope  of  the  block  extends  from  the  point  of 
definition  in  the  program  to  the  statement  labeled  10.  In  this  example,  a  vector  of 
virtual  processors  main ,  each  associated  with  a  single  loop  body,  is  defined.  Blocks 
can  also  be  nested  in  each  other.  Such  nesting  defines  a  hierarchy  of  blocks  and  helps 
in  global  program  optimization. 

Each  virtual  processor  produces  data,  typically  used  by  other  virtual  processors, 
and  in  turn  consumes  data  produced  by  others.  Performing  data-dependence  analysis- 
in  a  style  of  PTRAN  [12],  the  annotation  processor  can  find  the  dependencies  local 
to  each  block  and  data  structures  produced  and  consumed  by  the  block.  All  data 
produced  by  the  block  are  placed  in  the  memory  of  the  corresponding  virtual  pro¬ 
cessor.  The  created  parallel  tasks  are  extended  by  communication  statements  needed 
to  move  data.  Parallel  tasks  associated  with  virtual  processors  at  the  bottom  of  the 
block  hierarchy  are  the  smallest  components  used  >n  the  program  synthesis.  An  im¬ 
portant  step  towards  an  efficient  parallelization  of  FORTRAN  programs  involves  an 
equational  transformation  during  which  the  equational  equivalent  of  the  program  is 
generated.  The  transformed  programs  obey  the  single  assignment  rule  and  do  not 
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PARAMETER  (N  =  50) 

REAL  A(N,N),  TEMP 
INTEGER  IPIV(N) 

DO  ::  main  10  K  =  1,  N-l 
IPIV(K)  =  K 

DO  ::  pivot  20  L  =  K+l,  N 

20  IF  (ABS(A(IPIV(K),  K))  .LT.  ABS(A(L,  K))  IPIV(K)  =  L 
DO  ::  swap  30  L  =  K,  N 


TEMP  =  A(K,  L) 

A(K,  L)  = 

A(IPIV(K),  L) 

30 

A(IPIV(K) 

,  L)  =  TEMP 

DO  ::  lower 

40  L  =  K+l,  N 

40 

A(L,  K)  = 

A(L,  K)  /  A(K,  K) 

DO  ::  up.update  10  L  =  K+l,  N 

DO  10  M  : 

=  K+l,  N 

10 

A(M,  L) 

=  A(M,  L)  -  A(M,  K)  *  A  (K,  L) 

IPIV(N)=N 

STOP 

END 


Figure  3:  LU  Decomposition  of  a  matrix  A  in  FORTRAN 

contain  any  control  statements  [5].  The  transformation  is  done  in  the  following  steps: 

Reassignments  Elimination:  The  reassigned  variables  are  replaced  by: 

•  vector  (additional  dimension)  -  inside  loops, 

•  variants  -  in  “if”  branches  and  basic  blocks. 

Condition  Analysis:  Conditions  in  the  transformed  program  are  analyzed  using  a 
Sup-Inf  inequality  prover  [4]  and  the  Kaufl  variable  elimination  method  [8]  to 
find  pairwise  equivalent  or  exclusive  conditions. 

Variable’s  Variants  Elimination:  Variable  variants  created  in  equivalent  and  ex¬ 
clusive  conditions  are  merged  into  a  single  variable. 

Additional  Dimension  Elimination:  Memory  optimization  is  performed  to  re¬ 
place  entire  dimensions  by  windows  of  few  elements  for  multidimensional  vari¬ 
ables  [15]. 

The  transformed  FORTRAN  program  is  men  compatible  with  the  programs  pro¬ 
duced  by  annotating  EPL  programs. 
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2.3  Annotation  Processing 

Annotation  processing  includes: 

•  creating  parallel  tasks  defined  by  annotated  fragments  of  an  original  program, 

•  declaring  ports  needed  to  interconnect  created  tasks  into  a  network, 

•  building  task  communication  graph  that  show  data  dependences  between  cre¬ 
ated  tasks. 

To  translate  the  annotated  program  into  an  efficient  collection  of  parallel  tasks,  it 
is  necessary  to  embed  a  spanning  tree  into  the  tasks  communication  graph  [11].  The 
following  three  criteria  are  used  in  selecting  such  an  embedding: 

•  Dimension  nesting:  If  two  tasks  with  different  dimensionalities  are  connected 
in  the  task  communication  graph,  the  task  with  more  dimensions  should  be 
located  lower  in  the  spanning  tree.  If,  for  example,  tasks  T[i]  [j]  were  located 
above  the  tasks  D[j]  in  the  spanning  tree,  the  addressing  and  creation  of  child 
tasks  in  T  would  involve  executing  an  if-then  statement  in  all  i  *  j T  tasks. 

•  Range  nesting:  Whenever  possible,  tasks  sharing  the  same  range  should  be 
clustered  together  in  the  spanning  tree.  Variables  that  share  ranges  tend  to 
appear  in  the  same  equations.  Thus,  clustering  such  variables  together  decreases 
the  number  of  cross-process  references  to  distributed  variables. 

•  Data  flow:  The  total  communication  cost  of  the  selected  spanning  tree  should 
be  the  smallest  among  all  spanning  trees  satisfying  the  above  two  criteria. 

Let  G(V,E)  be  a  task  communication  graph  with  a  set  of  nodes  V  (representing 
processors)  and  a  set  of  edges  ECVxV  representing  communication.  With  each  edge 
ei,j  6  E  we  will  associate  the  cost  c(ei.j)  that  represents  the  volume  of  data  being 
sent  from  the  processor  i  to  the  processor  j.  With  each  spanning  tree  T,  we  will  also 
associate  the  distance  dT(ei>j)  that  defines  the  minimum  number  of  tree  edges  that 
have  to  be  traversed  on  the  path  from  task  i  to  task  j  .  The  cost  of  the  spanning  tree 
T  can  then  be  defined  as: 


C(T)=  £  c(e,,WT(eM) 

To  minimize  the  total  communication  cost  we  need  to  find  a  proper  cut-tree,  which 
can  be  done  by  solving  j  V  J  maximal  flow  problems.  Each  maximal  flow  problem  re¬ 
quires  0(|  V  |3)  applications  of  the  Ford- Fulkerson  labeling  procedure.  Hence,  finding 
the  solution  takes  0(|  V  j4)  steps. 

Trees  created  from  annotations  of  LU  decomposition  programs  are  shown  in  Figure 
4  (for  EPL  and  FORTRAN  programs). 
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Figure  4:  Communication  tree  for  EPL  and  FORTRAN  programs 

3  Program  Synthesis 

In  our  approach  a  parallel  computation  is  viewed  as  a  collection  of  cooperating  com¬ 
ponents.  The  components  are  denned  during  the  program  decomposition.  Their 
cooperation  requires  an  additional  description,  called  a  configuration.  The  configura¬ 
tion  guides  the  process  of  synthesis.  For  example,  components  of  the  configuration 
that  communicate  frequently  can  be  synthesized  into  a  single  task.  The  ratio  of 
physical  processors  to  virtual  processors  dictates  how  virtual  tasks  are  to  be  mapped 
onto  the  target  architecture.  Usually,  different  annotations  result  in  different  config¬ 
urations  and,  hence,  cause  different  code  to  be  generated.  The  user  can,  therefore, 
experiment  with  various  annotations  to  find  the  one  that  results  in  the  most  efficient 
code.  The  configurator  uses  the  dependence  graph  created  during  configuration  anal¬ 
ysis  to  generate  an  architecture-independent  parallel  description  which  is  fed  to  the 
code  generator. 

Configurations  define  tasks  (and  their  aggregates)  and  ports.  Statements  of  the 
configuration  represent  relations  between  ports  in  different  tasks.  Some  of  this  state¬ 
ments  are  generated  during  decomposition  (at  the  subprogram  level),  others  can  be 
supplied  by  the  user  (when  the  programs  are  integrated  into  a  computation). 

Tasks  created  dynamically  can  communicate  with  ports  located  at  parent,  child, 
and  sibling  tasks  (each  of  those  tasks  is  just  a  copy  of  the  same  program  or  program 
fragment,  except  that  a  parent  task  can  be  arbitrary). 

The  goal  of  configuration  processing  is  to  establish  scheduling  constraints  for  the 
overall  computation.  In  the  parallel  computation,  individual  process  correctness  is  a 
necessary  but  not  sufficient  condition  foi  the  correctness  of  the  entire  computation.  If 


a  task  has  input/output  ports  that  belong  to  a  cycle  in  the  configuration  graph,  then 
this  task’s  input  messages  are  dependent  on  the  output  messages.  Such  dependences 
(in  addition  to  dependences  imposed  by  the  statements  of  a  task)  have  to  be  taken 
into  account  in  generating  the  object  program  for  individual  tasks;  otherwise,  loss  of 
messages,  process  blocking,  or  even  a  deadlock  can  arise. 

The  algorithm  for  finding  external  data  dependences  has  been  presented  in  [13].  It 
produces  configuration  dependence  file  used  by  the  synthesizer  and  the  code  generator. 
This  file  contains  a  list  of  the  additional,  externally  imposed  data  dependences  (edges 
and  their  dimension  types)  that  need  to  be  added  to  the  task  array  graph.  One  task 
may  have  several  such  files,  each  associated  with  the  different  configuration  in  which 
this  task  participates. 

4  Run-Time  Task  Distribution 

One  of  the  most  challenging  problems  encountered  while  implementing  adaptive  sci¬ 
entific  computations  on  distributed  memory  machines  is  run-time  mapping  of  a  dy¬ 
namically  changing  computational  load  onto  the  parallel  processors.  The  published 
solutions  to  this  problem  focus  mostly  on  MIMD  architectures  and  coarse  grain  par¬ 
allelism  [3].  Recently  the  following  Rectilinear  Partitioning  Problem  (RPP)  has  been 
considered  in  [9]:  Partition  the  given  n  x  m  workload  matrix  into  ( N  +  1)  x  (M  +  1) 
rectangles  with  N  +  M  rectilinear  cuts  in  such  a  way  that  the  maximum  workload 
among  rectangles  is  minimized.  Such  optimization  is  appropriate  for  adaptive  finite 
element  computations  on  architectures  with  local  communication  that  is  faster  than 
global.  Since  balanced  partitions  tend  to  increase  the  volume  of  local  vs.  global 
communication,  solution  to  RPP  decreases  the  overall  communication  costs. 

In  [10]  we  investigated  adaptive  scientific  computations  on  SIMD  machines,  the 
problem  with  similar  motivation  and  applications  as  RPP  [9].  Unlike  RPP  however, 
in  which  the  sum  of  the  weights  is  taken  as  the  cost  of  a  rectangle,  we  measure  the 
rectangular  costs  as  the  ratio  of  workload  to  the  area  of  the  rectangle  that  represents 
the  number  of  processors  active  in  that  rectangle.  Our  approach  is  motivated  by  the 
mesh  refinement  techniques  of  the  considered  adaptive  methods  and  the  newly  intro¬ 
duced  coordinated  parallelism  on  the  CM-5  computer.  In  coordinated  parallelism  a 
machine  can  be  partitioned  into  several  parts  each  running  SIMD  code.  The  work¬ 
load  redistribution  results  in  regions  that  have  different  time-step  and/or  grid  size; 
therefore,  the  same  computation  is  nested  in  loops  with  different  boundaries.  That 
means  that  each  region  either  has  to  be  done  on  the  whole  machine  (sequentially,  one 
after  the  other  on  the  CM-2)  or  in  a  separate  partition  (in  parallel  on  the  CM-5). 
Each  entry  in  the  workload  matrix  represents  the  error  in  the  solution  obtained  by  an 
error  estimation  procedure  [2],  The  high-erroi  regions  need  recomputing  to  the  extent 
that  is  proportional  to  the  magnitude  of  the  error.  Hence,  the  number  of  processors 
reassigned  to  each  solution  region  should  be  proportional  to  the  refinement  factor. 

Consider  a  load  balancing  problem  as  illustrated  in  Figure  5  for  a  one-dimensional 
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Figure  5:  Example  of  partitioning  in  one-dimension 

problem.  The  uniform  mesh  yields  the  solution  with  a  high  error  in  the  interval 
6  <  x  <  c  and  within  the  required  accuracy  in  intervals  a  <  x  <  b  and  c  <  x  <  d. 
Taking  the  magnitude  of  an  error  as  an  estimate  of  the  work  u>,  for  each  element 
i  =  we  assign  a  small  weight  c  <<  maxi{u>i}  to  work  estimate  in  regions 

a  <  x  <  b  and  c  <  x  <  d.  To  balance  the  workload,  the  majority  of  the  processors 
should  be  assigned  the  interval  b  <  x  <  c. 

In  adaptive  solutions  of  partial  differential  equations  parallel  tasks  perform  ba¬ 
sically  the  same  computation  over  different  spatial  subdomains  (intervals  for  one- 
di'mensional  problems)  and  with  different  discretization  parameter  Ax.  Let  K  denote 
the  number  of  such  tasks.  It  is  important  to  keep  this  number  small  for  the  following 
reasons.  The  subdomain  interactions  are  proportional  to  the  number  of  existing  sub- 
domains  and  in  higher  dimensions  such  interactions  require  time-consuming  global 
communications.  In  each  time  step  of  the  subdomain  computation,  a  fraction  of  ex¬ 
ecuted  code  is  subdomain  specific  (e.g.  in  hyperbolic  equations  the  time  step  has 
to  be  set  differently  in  each  subdomain).  For  purely  SIMD  machines,  execution  of 
this  code  fraction  has  to  be  done  in  K  consecutive  stages.  In  each  stage,  processors 
in  one  subdomain  are  executing  while  processors  belonging  to  the  remaining  I\  —  1 
subdomains  remain  idle1.  Therefore,  each  subdomain  associated  with  a  parallel  task 
should  represent  a  localized  structure  in  the  solution  domain. 

Figure  6(a)  shows  an  example  of  the  more  difficult  two-dimensional  case  in  which 
a  coarse  mesh  is  trivially  mapped  to  the  processor  mesh.  In  regions  A  and  B,  the 
mesh  must  be  refined  due  to  the  presence  of  high  errors.  Hence,  we  have  to  spread 
sub-domains  A  and  B  over  bigger  rectangular  sub-sets  of  processors  to  improve  load 
balancing  as  in  Figures  6(b)  and  (c). 

If  we  are  employing  mesh-movement  or  static  rezone  techniques,  the  mesh  elements 
are  moved  into  high-error  regions.  A  global  solution  strategy  will  refine  the  nigh- 
error  regions  and  repeat  the  entire  step  of  the  iteration.  Consequently,  we  will  need 
a  re-assignment  of  processors.  A  local  solution  strategy,  on  the  other  hand,  repeats 

‘For  more  general  architectures,  capable  of  coordinated  parallelism  mode  of  execution  (i.e.  CM- 
5),  all  K  subdomains  will  be  able  to  execute  this  fraction  of  code  in  parallel. 
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the  iteration  only  where  it  is  needed.  Hence,  local  refinement  results  in  less  direct 
computation  and  enables  more  processors  to  be  assigned  to  regions  A  and  B.  However, 
local  refinement  requires  more  interactions  between  the  local  and  global  solutions. 
Such  interactions  involve  global  communication  that  can  outweight  the  benefits  of 
an  adaptive  procedure.  Global  solutions  and  mesh-movement  techniques  require  less 
interactions  of  this  kind.  Careful  buffering  of  the  high-error  regions  can  increase 
the  number  of  iterations  executed  before  regridding  or  mesh  movement  is  needed. 
This  will  in  turn  decrease  the  frequency  of  the  needed  load  balancing.  It  is  this 
global  mesh-refinement  and  mesh  movement  techniques  executed  on  a  mesh  connected 
architectures  that  motivated  us  to  develop  density-type  partitioning. 
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Figure  6:  (a)  Coarse  mesh  with  high  error  regions  A  and  B.  (b)  repartitioning  with 
global  refinement  (c)  repartitioning  with  local  refinement  (d)  Nicol's  partitioning 

It  should  be  noted  that  applying  Nicol’s  [!)]  partitioning  methodology  RPP  to  the 
example  shown  in  Figure  6(d)  results  in  assigning  unnecessary  processors  to  regions 
C  and  D.  To  avoid  such  waste,  we  did  not  restrict  our  partitioning  methodology  to 
rectilinear  cuts  extending  across  the  whole  domain  i  i  both  dimensions.  Instead,  in 
our  problem  definition  and  solution  [10],  we  require  that  I\  selected  rectangles  cover 
the  whole  domain.  The  heuristics  for  the  two-dimensional  case  projects  the  weights 
to  one-dimension  and  results  in  rectilinear  cuts  extending  across  the  whole  dimension 
in  one  direction.  Figure  6(b)  shows  an  example  of  this  kind  of  partition. 
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Let  Pk  be  a  set  of  partitions  of  a  one-dimensional  workload  array  w,-,  i  =  1, . . . ,  n 
into  K  subintervals  (arlfc,  ),  where  1  <  x\k  <  x2k  <  n,  k  -  1, . . . ,  K.  The  one¬ 
dimensional  workload  partitioning  problem  can  be  then  stated  as: 


it  =  1, . . . ,  K  €  Pk 


} 


(1) 


As  shown  in  Table  1,  selecting  different  meaning  for  operations  0  and  0  we 
can  obtain  different  optimization  problems  from  this  formulation.  For  0  =  min , 
0  =  max  and  f{x\k,x2k)  =  1  we  obtain  the  Nicol’s  problem  that  has  solutions  of 
complexity  0(I\n)  and  0(n  4-  ( Klogn )2)  [9]. 


Problem 
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i 
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OO 

Table  1:  Instances  of  problem  represented  by  equation  (1) 

The  problem  involving  load  balancing  for  adaptive  PDE  solvers  discussed  in  this 
section  is  obtained  for  0  =  min ,  0  =  max  and  f(x ik,x2k)  —  (x2k  —  xlk  -f  1).  i.e., 
we  divide  the  sum  of  the  workloads  in  each  partition  by  the  interval  length  (i.e..  the 
number  of  processors).  There  is  a  similarity  between  the  weighted  independent  set 
for  interval  graphs  and  our  problem  [7].  The  interval  graph  for  our  problem  can  be 
created  by  having  a  node  representing  one  of  the  possible  subintervals  (jrlfc,X2t)  with 
the  weight  Y^=Xlk  fix\kix 2*)  and  edges  representing  the  intersections  between  the 
subintervals.  In  such  a  graph,  the  independent  set  of  size  K  which  covers  the  whole 
interval.  1, . . . ,  n,  gives  the  solution  to  the  original  problem.  We  convert  that  interval 
graph  to  a  directed  acyclic  graph  (DAG)  and  apply  the  shortest  path  algorithm  to 
find  the  minimum  weight  dominating  set  [10].  This  approach  results  in  the  optimal 
algorithm  for  the  one-dimensional  case  and  leads  also  to  a  heuristic  algorithm  that 
can  be  easily  generalized  to  two  dimensions  (by  projecting  the  workloads  to  one 
dimension). 

5  Conclusion 

Our  approach  is  based  on  the  following  presumptions: 
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•  Adaptivity  is  at  the  center  of  efficient  methods  for  solving  partial  differential 
equations. 

•  Annotations  provide  an  easy  and  efficient  way  for  parallelization  of  existing 
codes. 

•  Absence  of  control  statements  simplifies  program  analysis  and  increases  the 
compiler  ability  to  produce  an  efficient  parallel  code. 

•  Most  parallel  code  optimization  problems  are  NP-hard;  hence,  development  of 
proper  heuristics  is  important. 

•  A  hierarchical  view  of  parallel  computation  is  helpful  in  extracting  functional 
parallelism. 

Program  decomposition  through  annotations  and  computation  synthesis  through 
configuration  can  support  efficient  parallel  code  generation  for  domain-specific  compu¬ 
tation.  Adaptivity,  with  its  associated  error  estimates  and  shrewd  use  of  computation 
only  in  regions  where  accuracy  requirements  are  not  satisfied,  can  provides  the  needed 
numerical  reliability  and  efficiency  to  parallel  computation.  Massive  parallelism  com¬ 
bined  with  adaptivity  offers  a  promise  of  true  breakthroughs  that  will  allow  scientists 
and  engineers  to  solve  the  most  demanding  problems  with  available  resources. 

Our  research  on  scalable  program  synthesis  is  in  its  early  stages  and  many  issues 
remain  unexplored.  Future  work  on  program  synthesis  should  include  more  work 
on  run-time  code  optimization.  Large  applications  will  measure  the  efficiency  of  the 
generated  solutions. 
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Automated  Interpretation  of  Topographic  Maps 


T.  Cronin 

CECOM  Signals  Warfare  Directorate 
Warrenton  VA  22186-5100 


Abstract:  Some  new  results  which  impact  favorably  upon  the  issue  of  automated  topographic  map  interpretation 
are  presented.  Actually,  the  "new”  results  consist  of  heretofore  undiscovered  applications  of  two  concepts  known  to 
mathematicians  and  computer  scientists  for  many  years:  binary  search,  and  the  normal  vector.  Binary  search  is 
extended  by  the  technique  from  not  only  one  to  two  dimensions,  but  arguably  to  three  dimensions,  since 
topographical  maps  are  two-dimensional  representations  of  three-dimensional  surfaces.  Such  maps  are  essentially 
sorted  hierarchies  of  nested  contours,  which  form  a  multiply-connected  subdivision  of  die  plane.  The  perimeter  of  a 
subdivision  element  is  defined  by  a  set  of  contours  of  extremal  elevation  and  the  edges  of  the  map;  a  naming 
convention  attaches  a  label  to  each  element  of  the  planar  subdivision.  Whenever  one  is  afforded  the  luxury  of  dealing 
with  a  sorted  data  structure,  one  may  invoke  the  power  of  binary  search  to  achieve  0[  log  n  ]  time  complexity  during 
processing  of  a  topographical  query,  where  n  is  the  number  of  contours  which  comprise  a  specific  element  of  the 
planar  subdivision.  A  topographical  query  is  a  request  by  a  user  to  interpret  the  position  of  an  arbitrary  map 
coordinate,  called  the  query  point,  in  the  context  of  a  topographic  map  background.  An  "interpretation’’  as  currently 
defined  consists  of  a  five-tuple  of  information:  the  label  of  the  map  subdivision  element  within  which  the  query 
point  resides;  the  topographical  contour  of  the  subdivision  element  within  which  the  point  minimally  resides;  the 
local  slope  at  the  point;  the  local  elevation  at  the  point;  and  the  flank  of  the  partition  element  (hillside)  upon  which 
the  point  is  situated.  As  an  example,  the  following  list  is  an  interpretation  of  a  query  point:  (Ml  Hood  subdivision, 
contour  #  10600-d,  65  degree  gradient,  10655  feet  elevation,  350  deg  NW).  As  is  the  case  with  one-dimensional 
binary  search,  the  two-dimensional  version  must  concern  itself  with  items  having  identical  keys.  Because 
topographical  maps  may  contain  multiple  contours  lying  at  the  same  elevation,  the  topographical  query  process 
must  have  a  mechanism  for  choosing  among  them  before  proceeding.  Thus,  when  "halving  the  interval",  one  must 
check  to  ensure  that  the  interval  is  in  fact  uniquely  defined.  Inclusion  testing  within  contours  is  achieved  with  a 
deterministic  point-in-polygon  algorithm  developed  by  the  Army  in  previous  research.  The  traditional  normal  vector 
is  utilized  extensively  by  the  point-in-polygon  algorithm,  and  also  by  the  processing  components  which  interpolate 
slope  and  elevation,  and  determine  hillside  emplacement  A  new  theorem  derived  from  the  law  of  cosines  provides  a 
decision  rule  based  on  integer  arithmetic  to  decide  which  segments  of  a  polygonal  boundary  justify  a  computation  of 
the  normal  vector.  As  a  byproduct  of  the  research,  an  algorithm  based  on  the  Cevian  formula  has  been  developed  to 
Find  the  nearest  segment  to  a  query  point,  without  using  any  floating  point  operations  whatsoever.  Future  research 
issues  include  the  topic  of  visual  line-of-sight  (binary  map  coloring)  from  a  query  point,  and  an  analysis  of  the  space 
and  time  complexity  of  the  new  technique,  vs.  the  more  burdensome  alternative  of  storing  elevation  and  slope  data  in 
large  raster  archives. 


I.  BACKGROUND  AND  TERMINOLOGY 


Topographic  Line  Maps. 

A  topographic  line  map  (TLM),  also  known  as  a  contour  map,  is  a  vector  representation  of  the  boundaries 
of  cross  sections  of  the  earth's  surface.  The  projection  of  the  boundary  of  a  cross  section  of  the  earth's  surface  onto 
the  xy-plane  is  called  a  topographic  contour.  The  equispacing  between  cross  sections  along  the  z  axis  is  called  the 
contour  interval  of  the  map.  A  contour  map  is  bounded  by  the  rectangular  edges  of  a  map  sheet  The  edges  of  a  map 


173 


form  what  is  called  a  clipping  region,  which  prevents  an  observer  from  knowing  the  behavior  of  portions  of  contours 
which  pass  outside  the  rectangular  perimeter  of  the  map. 

Heuristics  to  guide  map  understanding  soon  become  apparent  to  a  novice:  e.g.,  when  contours  are  closely 
packed  together  in  the  xy-plane,  the  underlying  terrain  is  steep;  when  far  apart,  the  terrain  is  flat,  or  of  gentle 
gradient  As  a  general  rule,  contours  do  not  cross,  although  there  are  rare  exceptions  such  as  natural  bridges, 
overhangs,  or  bizarre  sandstone  formations  like  those  found  in  Utah.  For  those  well-versed  in  their  interpretation, 
topographic  maps  can  provide  a  realistic  portrayal  of  actual  terrain.  However,  many  human  beings  find  topographic 
maps  difficult  to  interpret  and  have  problems  visualizing  terrain  from  contour  data.  It  is  for  this  reason  that  the 
Army  Topographic  Engineering  Center  has  opted  to  utilize  perspective  displays  as  an  alternative  to  topographic 
maps  [Tl].  Perspective  displays  render  an  artistic  version  of  a  terrain  as  it  appears  from  some  vantage  point  near  or 
upon  the  ground.  Figure  1  is  a  graphic  borrowed  from  reference  [K4],  and  portrays  a  perspective  display  of  a  terrain, 
together  with  its  corresponding  topographic  contour  representation. 


Figure  1.  A  perspective  display  and  Its  corresponding  topographic  line  map. 


Interpreting  a  Topographic  Line  Map. 

The  objective  of  automated  topographic  map  interpretation  is  to  write  a  computer  algorithm  which  locally 
describes  a  random  query  point  in  the  context  of  a  contour  map.  An  interpretation  as  currently  defined  consists  of 
five  pieces  of  information:  the  label  (if  it  exists)  of  the  subdivision  within  which  the  query  point  resides;  the  name 
of  the  topographic  contour  which  brackets  the  query  point  from  outside;  the  elevation  above  or  below  sea  level  at  the 
query  point;  the  slope  of  the  terrain  at  the  query  point;  and  the  directional  gradient  at  the  query  point  As  an 
example,  the  following  list  constitutes  an  interpretation  of  a  query  point:  (ML  Hood  subdivision,  contour  #10600-d, 
65  degree  gradient  10655  feet  elevation,  350  deg  NW). 


The  five  pieces  of  information  currently  sought  by  the  algorithm  hardly  constitute  a  complete  interpretation 
of  a  query  point.  Other  descriptors  are  desirable  in  the  long  term:  e.g.,  the  visual  line-of-sight  from  the  query  point, 
the  profile  of  a  traversible  path  passing  through  the  query  point,  the  profile  of  a  path  which  optimally  avoids  the 
query  point,  the  feasibility  of  using  the  query  point  as  a  site  for  sensor  placement,  etc.  However,  the  five  primitive 
data  currently  being  returned  by  the  algorithm  go  far  toward  providing  inputs  to  some  of  the  higher  level  queries, 
which  may  be  synthetically  constructed  from  the  primitive  queries. 


Contour  Notation. 

In  this  section,  notation  is  adopted  to  facilitate  reasoning  with  topographic  contours.  Associated  with  every 
contour  is  a  specific  value,  denoted  E1(Q,  which  represents  the  contour's  elevation  above  or  below  sea  level.  The 
elevation  is  modulo  k,  where  k  is  the  fixed  contour  interval  of  the  topographic  map.  On  a  particular  map,  there  may 
be  several  distinct  contours  with  the  same  elevation  value;  for  spatial  reasoning  applications  it  is  important  to 
differentiate  among  them.  Each  contour  with  an  elevation  value  above  sea  level  is  contained  within  another  contour, 
and  may  itself  contain  contours. 

If  contour  Ci  is  contained  within  contour  C2,  then  Ci  is  said  to  be  nested  within  C2.  A  contour  cannot  be 
contained  within  two  or  more  contours  which  are  not  nested,  but  it  can  contain  multiple  contours  which  are  not 
nested.  If  C  is  a  contour  of  interest,  then  we  denote  the  contour  which  minimally  contains  C  to  be  C*.  By 
minimally  contained,  it  is  meant  that  any  other  contour  D  other  than  C*  which  contains  C  also  contains  C*,  which 
implies  that  both  C"  and  C  are  nested  within  D.  A  contour  minimally  contained  by  C  is  called  C+,  where  the  set  of 
all  such  contours  is  denoted  (C*).  If  a  query  point  lies  between  two  elements  of  {C+},  then  it  is  said  to  be  a  saddle 
point.  Note  that  (C+)  may  be  the  null  set.  A  graphic  illustrating  these  concepts  is  shown  below. 
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A  query  point  and  its  bracketing  contours. 


A  query  point  is  defined  to  be  any  random  point  of  interest  Except  for  special  cases,  a  query  point  is 
bracketed  by  adjacent  contours  of  a  map:  one  contour  which  encloses  it,  called  the  outer  bracket,  and  another  contour 
which  does  not  enclose  it,  called  the  inner  bracket .  Since  contours  are  well-ordered  at  equispaced  elevations,  the 
difference  in  elevation  between  bracketing  contours  is  equal  to  plus  or  minus  the  fixed  contour  interval  of  the  map 
(except  for  a  zero  difference  at  saddle  or  culvert  points).  The  figure  below  illustrates  the  brackets  of  a  query  point 


II.  TWO-DIMENSIONAL  BINARY  SEARCH  APPLIED  TO  TOPOGRAPHIC  MAPS. 


Extending  binary  search  to  two  dimensions  to  interpret  topographic  maps. 

Binary  search  has  traditionally  been  applied  to  a  one-dimensional  data  structure,  sorted  by  some  user-defined 
ordering  property.  The  data  structure  might  be  an  array  of  numbers  sorted  by  the  natural  ordering  erf  the  reals,  or  a 
list  of  employee  records  sorted  alphabetically  by  name.  One  commonly  utilized  data  structure  is  2D  trees,  in  which 
the  data  consists  of  a  set  of  ordered  pairs  of  integers.  In  a  2D  tree,  the  data  is  sorted  on  two  keys  (the  abscissa  and 
the  ordinate),  with  one  key  primary.  A  2D  tree  is  not  a  true  instance  of  two  dimensional  binary  search  data  structure, 
because  one  key  is  predominant  over  another  during  the  sorting  process.  A  better  candidate  is  outlined  at  [K2],  in 
which  an  interior  point  method  for  linear  programming  "halves"  an  ellipse  during  point-in-polygon  testing. 

However,  to  be  truly  elegant,  two  dimensional  binary  search  should  avail  itself  of  the  natural  containment  property 
inherent  to  two  dimensions.  In  the  digital  domain  of  the  computer,  two  dimensional  objects  are  in  general 
polygons.  Just  as  the  one  dimensional  version  must  check  to  see  if  a  point  lies  between  two  other  points,  the  two- 
dimensional  version  is  required  to  decide  if  a  polygon  is  contained  "between"  two  other  polygons  {Cl].  Betweenness 
is  equivalent  to  bracketing  a  query  point  with  nested  polygons. 

Topographical  contours  exhibit  a  natural  ordering  due  to  the  way  in  which  the  faces  of  nature  have 
combined  to  stabilize  the  crust  of  the  earth.  For  example,  gravity  has  assured  that  the  top  portion  of  a  mountain  has 
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a  smaller  cross  section  than  its  base.  Thus,  when  projected  onto  a  plane,  contours  from  the  same  mountain  appear 
to  be  nested.  Ordering  by  elevation,  and  nesting  by  containment  are  properties  which  may  be  exploited  to  sort 
contours.  The  data  structure  which  results  by  appealing  to  a  two-dimensional  sort  on  elevation  and  nesting  is  called 
the  contour  containment  graph.  The  motivation  is  that  to  exploit  the  0[  log  n  ]  query  power  of  binary  search,  one 
requires  that  the  underlying  data  structure  be  sorted.  We  will  see  below  that  there  are  two  preprocessing  steps 
required  to  set  up  an  efficient  two-dimensional  search  of  topographic  maps:  the  first  is  the  construction  of  the 
contour  containment  graph,  and  the  second  is  the  partitioning  of  the  containment  graph  into  regions  suitably  indexed 
far  binary  search. 


The  Contour  Containment  Graph,  and  Labeling  of  Topographic  Features. 

As  a  first  step  in  constructing  the  contour  containment  graph,  we  can  uniquely  label  each  contour,  and  then 
sort  all  contours  on  elevation,  in  ascending  order.  We  then  ’nest"  contours.  To  illustrate,  suppose  a  specific  100- 
meter  contour  is  labeled,  and  the  contour  interval  of  the  map  is  10  meters:  we  now  seek  to  find  all  110  meter 
contours  contained  within  the  labeled  contour.  If  we  find  one,  we  create  a  pointer  from  the  100- me  ter  contour's  label 
to  the  label  of  the  1 10  meter  contour  discovered  to  be  contained  in  the  contour.  We  continue  this  process  until  no 
more  contours  are  found  to  be  within  the  100-meter  contour.  We  repeat  this  operation  for  all  other  100-meter 
contours.  When  this  step  is  completed,  we  switch  our  baseline  cef  complex  from  all  those  bounded  by  100-meter 
contours  to  all  those  bounded  by  1 10-meter  contours,  and  continue  tnc  process  until  there  are  no  contours  remaining 
to  be  processed.  An  example  of  a  terrain  and  its  contour  containment  graph  is  depicted  in  the  figure  below.  The 
terrain  features  three  hills.  All  three  are  contained  within  baseline  contours  of  twenty  and  forty  meters  elevation. 
Note  that  a  label  may  be  associated  with  the  forty  meter  contour  to  delimit  the  extent  of  the  "hill  country”.  Also,  a 
label  may  be  installed  on  each  of  the  sixty  meter  contours  to  name  the  individual  hills.  One  of  the  three  hills 
contains  two  small  knobs  at  the  top,  at  an  elevation  of  one  hundred  twenty  meters.  Each  time  that  the  set  (C+} 
contains  multiple  elements  for  a  giver,  contour  C,  another  level  of  sorting  must  be  initiated  to  assure  that  the 
contour  containment  graph  is  properly  structured  and  nested  for  binary  search. 
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Figure  4.  A  sorted  terrain,  nested  In  preparation  for  binary  search. 
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Within  each  level  of  a  contour  containment  graph,  binary  search  may  be  invoked  to  achieve  0[  log  n  ]  time 
complexity,  where  n  is  the  number  of  contours  contained  at  that  level.  To  illustrate,  in  the  figure  below,  a  hill  is 
represented  by  eight  contours.  On  the  first  iteration  of  binary  search,  a  contour  halfway  up  the  hillside  at  eighty 
meters  is  considered,  and  the  query  point  is  determined  to  be  inside.  On  the  second  iteration,  it  is  determined  that  the 
query  point  is  not  inside  the  one  hundred  twenty  meter  contour,  which  is  three  quarters  of  the  way  up.  The  third 
iteration  decides  that  the  point  is  not  inside  the  one  hundred  meter  contour,  which  is  five  eighths  of  the  way  to  the 
top.  At  the  next  iteration  binary  search  concludes,  having  bracketed  the  query  point  between  the  eighty  and  one 
hundred  meter  contours,  while  having  interrogated  only  three  of  the  eight  contours. 


Although  two-dimensional  binary  search  may  achieve  0[  log  n  ]  time  complexity  over  a  database  of  n 
contours,  the  issue  remains  open  regarding  the  time  complexity  of  the  search  as  a  function  of  the  number  of  vertices 
contained  within  a  given  contour.  For  example,  one  topographic  contour  may  contain  a  single  vertex,  whereas 
another  may  contain  thousands  of  vertices.  Processing  a  set  of  contours  comprised  of  a  small  number  of  vertices  is 
clearly  more  desirable  for  performance  considerations  than  processing  a  set  of  contours  comprised  of  a  large  number 
of  vertices.  An  objective  metric  of  time  complexity  should  take  into  account  both  the  number  of  contours  and  the 
number  of  vertices  per  contour. 


Partitioning  a  Topographic  Map  for  Binary  Search. 

Any  topographical  map  contains  contours  of  locally  minimum  elevation.  These  are  readily  identified  from 
the  contour  containment  graph  developed  in  the  preceding  section.  The  strategy  is  to  partition  the  map  between  all 
such  contours,  by  constructing  synthesized  boundaries  to  act  as  cuts  for  binary  search.  Optimal  placement  of  the  cut 
boundaries  is  a  load  balancing  problem,  which  needs  to  address  not  only  the  number  of  contours  within  each  cut,  but 
the  total  number  of  vertices  which  comprise  contours  in  the  cut.  In  the  diagram  below,  four  hills  have  been 
partitioned  by  synthesized  boundaries  into  regions  suitable  for  binary  search.  Note  that  the  bold  lines  are  not 
contours  but  synthetic  boundaries.  The  first  cut  runs  roughly  down  the  middle  of  the  map,  and  segregates  the 
rightmost  hill  From  the  other  three.  Observe  that  the  first  cut  contains  nine  contours  on  the  left,  but  only  seven  on 
the  right.  This  is  not  arbitrary,  but  is  designed  to  compensate  for  the  longer  perimeters  of  the  contours  on  the  right 
of  the  cut  (it  is  implied  that  a  longer  perimeter  equates  to  a  larger  number  of  vertices  in  the  contour  boundary  data). 
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The  second  cut  is  dependent  upon  the  decision  mack  during  the  first  cut  If  a  query  point  is  to  the  left  of  the  first  cut 
that  the  second  cut  lies  between  the  two  most  northerly  hills  and  the  hill  in  the  southwest  comer.  Conversely,  if 
the  query  point  is  to  the  right  of  the  first  cut  then  the  second  cut  ties  halfway  up  the  rightmost  hill.  Continuing  in 
this  fashion,  the  number  three  cuts  are  synthesized.  No  further  cuts  are  shown,  but  the  logic  to  create  them  is 
similar. 


Dealing  with  contours  which  exit  tbe  clipping  region  of  a  map. 

Figure  6  is  oversimplified.  In  general,  contours  are  not  so  well-behaved.  There  is  one  common  problem  to 
consider  a  contour  may  exit  the  rectangular  region  bounding  the  map,  and  therefore  pass  outside  the  clipping 
region.  The  problem  may  be  solved  by  conjoining  the  troublesome  contour  with  the  rectangular  edge  of  the  map. 
This  contrivance  forces  two  polygons  to  be  synthesized  from  the  errant  contour,  to  create  a  data  structure  compliant 
with  two-dimensional  binary  search.  Synthetic  boundaries  for  binary  search  may  also  be  constructed  accordingly. 

The  figure  below  depicts  a  clipped  contour  of  forty  meter  elevation  which  exits  the  map  at  both  sides.  Two 
dimensional  binary  search  requires  that  data  structures  be  in  the  form  of  polygons.  We  synthetically  create  two  new 
polygons  by  conjoining  the  clipped  contour  with  the  edges  of  the  map.  Because  the  point-in-polygon  algorithm  of 
choice  (described  in  the  next  section)  requires  a  sense  of  handedness,  we  assure  that  die  vertices  of  the  new  polygons 
are  in  counterclockwise  order.  At  execution  time,  we  may  now  ask  if  a  query  point  is  contained  within  either  the 
upper  or  the  lower  polygon  manufactured  by  utilizing  the  clipped  contour,  and  proceed  accordingly. 


179 


Clipping 

Region 


Figure  7.  Creating  two  polygons  from  a  clipped  contour. 


III.  AN  INCLUSION  (POINT-IN-POLYGON)  ALGORITHM,  AND  PROXIMITY. 


Perceived  shortcomings  of  currently  available  point-in-polygon  technology. 

The  two-dimensional  binary  search  algorithm  requires  a  utility  function  to  establish  whether  or  not  a  query 
point  is  inside  a  topographical  contour.  The  utility  function  is  a  true  workhorse,  so  it  must  be  efficient.  There  is 
no  margin  for  error,  which  means  that  point-in-polygon  algorithms  which  rely  on  the  precision  of  machine 
arithmetic  are  inappropriate  candidates.  For  this  reason,  approaches  based  on  the  winding  number  or  the  parity 
algorithm  are  currently  infeasible.  The  Apple  Macintosh  family  of  computers  has  implemented  a  predicate  called 
"point-in-region-p",  available  as  part  of  the  Quickdraw  graphics  repertoire,  but  the  predicate  consumes  quadratic 
amounts  of  region  space  in  memory,  which  becomes  prohibitive  for  even  a  moderate  number  of  polygonal 
boundaries.  A  high-performance  algorithm  from  the  computational  geometry  literature,  based  on  triangulation  [K3], 
is  a  viable  candidate,  although  it  remains  an  untested  quantity,  since  it  has  never  been  tasked  against  a  multi¬ 
megabyte  database  of  topographical  contours. 

Because  of  perceived  shortcomings  of  on-the-shelf  point- in-polygon  algorithms,  and  the  lack  erf  benchmark 
data  to  test  the  performance  of  the  triangulation  algorithm,  the  author  has  opted  to  implement  hi s  own  algorithm 
[C2],  which  has  been  extensively  tested  against  actual  contour  data.  The  algorithm  assures  that  a  contour  is  oriented 
in  a  counterclockwise  direction,  so  that  the  interior  of  the  contour  is  to  the  left  during  traversal.  Inclusion  testing  is 
then  conducted  as  a  function  of  a  query  point's  proximity  to  a  contour  (see  figure  below).  One  benefit  of  the 
algorithm  is  that  it  returns  distance  and  direction  (normal  vector)  to  the  nearest  point  on  a  boundary,  in  addition  to 
the  inclusion  decision.  As  will  be  seen  below,  the  normal  vector  is  crucial  to  topographic  map  interpretation.  As 
originally  conceptualized,  the  algorithm  anticipated  that  every  pixel  in  a  digital  boundary  would  be  explicitly 
available  as  part  of  the  data  structure.  However,  the  Defense  Mapping  Agency  does  not  represent  feature  boundaries 
so  obviously.  Instead,  a  contour  is  provided  in  chain-coded  format,  where  the  boundary  of  the  contour  consists  of  a 
set  of  ordered  vertices.  It  is  up  to  the  user  of  the  data  to  create  the  edges  which  join  the  vertices. 
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Proximity  and  Inclusion  of  a  Query  Point  p  to 
Contour  C 


0.  C  is  a  set  of  chain-coded  vertices 
with  implied  edges. 

1.  Order  C  counterclockwise. 

2.  Selectively  drop  normal  n  to  C. 

3.  If  magnitude  of  n  <  magnitude  of 
all  other  normals,  then  p  is  closest 
to  edge  e  of  contour  C. 

4.  If  p  is  to  the  right  of  C,  it  is 
outside;  otherwise  it  is  inside. 


Figure  8.  The  normal  vector  may  be  used  to  decide  inclusion. 


The  Voronoi  diagram  for  data  produced  by  the  Defense  Mapping  Agency. 

Vector  data  distributed  by  the  Defense  Mapping  Agency  (DMA)  contains  three  kinds  of  objects:  points, 
line  segments,  and  polygons.  It  has  been  known  for  some  time  that  the  skeleton,  or  medial  axis  of  a  polygon 
consists  of  potions  of  parabolas  and  line  segments  [B2],  The  parabolas  are  the  locus  of  equal  distance  between 
points  and  segments.  The  line  segments  are  the  locus  (angle  bisectors)  between  extended  segments.  It  is  also  true 
that  for  any  set  of  points,  segments,  and  polygons  the  equidistance  locus  consists  of  parabolas  and  line  segments. 
Thus,  the  Voronoi  diagram  for  DMA  data,  which  is  defined  to  be  the  locus  of  equal  distance,  is  in  general  parabolic. 

There  is  currently  no  commercial  product  available  to  generate  the  parabolic  Voronoi  diagram  for  an 
arbitrary  set  of  polygons,  segments,  and  points.  However,  there  are  three  research  and  development  tools  (of  varying 
degrees  of  robustness)  circulating  among  researchers  in  academia  [M3].  The  developmental  products  implemented  to 
date  have  encountered  problems  of  numerical  precision,  primarily  when  deciding  upon  which  side  of  a  parabola  a 
query  point  lies  [F2].  However,  as  indicated  at  reference  [El],  the  theory  behind  the  sweepline  algorithm  [FI]  to 
generate  the  linear  Voronoi  diagram  should  be  directly  extensible  to  the  parabolic  diagram.  It  is  clear  that  for  the 
asymptotic  solution  to  the  static  proximity  problem,  the  Voronoi  diagram  is  the  paradigm  of  choice.  As  a  stopgap 
measure,  until  a  tool  to  generate  the  parabolic  diagram  is  available,  the  author  has  developed  his  own  proximity 
algorithm,  described  below,  based  on  restricted  use  of  the  normal  vector.  The  author’s  algorithm,  unlike  the  Voronoi 
diagram,  facilitates  dynamic  objects.  If  an  object's  position  changes,  the  Voronoi  diagram  must  reinvoke  a  relatively 
expensive  preprocessing  step,  whereas  the  author's  algorithm  simply  replaces  the  object's  old  boundary  position  with 
the  new. 
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Finding  the  nearest  point  of  a  contour  to  a  query  point. 

A  contour,  which  when  represented  with  digital  data  is  in  the  form  of  a  polygon,  consists  of  a  set  of 
vertices  and  the  implied  edges  which  connect  the  vertices.  Thus,  when  one  speaks  of  proximity  to  a  contour  from  a 
query  point,  one  is  actually  referring  to  minimal  Euclidean  distance  to  the  set  of  vertices,  vs.  distance  to  the  set  of 
edges. 


Minimal  distance  to  an  edge  is  non-trivial  to  compute.  This  process  entails  dropping  the  normal  vector 
from  a  query  point  to  the  edge.  Since  floating  point  operations  may  be  required  at  every  edge  to  which  the  normal  is 
dropped  (although  the  author  introduces  below  a  new  technique  which  avoids  floating  point  arithmetic),  we  would 
like  to  limit  the  number  of  edges  incurring  such  an  expensive  operation.  If  the  normal  vector  strikes  an  edge 
directly,  the  edge  is  said  to  be  admissible  to  the  normal  vector.  Refer  to  the  figure  below.  Clearly,  it  does  not 
behoove  us  to  drop  the  normal  from  query  point  p  to  edge  e2,  since  the  tip  of  the  normal  does  not  even  intersect  t2, 
but  rather  its  extension.  Such  cases  are  precisely  those  which  we  strive  to  avoid,  by  appealing  to  a  normal  vector 
admissibility  filtering  technique.  It  will  be  shown  below  that  as  a  side  effect,  the  filtering  technique  returns  minimal 
distance  to  a  vertex. 


Normal  Vector  Admissibility 


el  ’*2-*3  ,  .  . 


Figure  9.  Certain  contour  edges  do  not  admit  the  normal  vector. 


Derivation  of  edge  admissibility  conditions  from  the  law  of  cosines. 

Construct  orthogonal  rays  from  the  endpoints  of  contour  edge  e,  as  in  Figure  10  below.  Now  suppose  that 
query  point  p  lies  between  the  rays.  Note  that  the  angle  between  edges  x  and  e  is  acute,  as  is  the  angle  between 
edges  y  and  e.  Let  the  angle  between  y  and  e  be  0]  and  the  angle  between  x  and  e  be  02.  Then,  by  the  law  of 
cosines. 
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x2*y2  +  e2-2yccos0i  [1] 

y2  *  x2  +  e2  -  2  x  e  cos  02  p] 

The  cosine  function  is  positive  for  acute  angles.  We  therefore  obtain 

x2  +  aj  =  y2  +  e2  [3] 

y2  +  ct2  *  x2  +  e2;  ai,ot2>0  [4] 

These  equations  ate  alternatively  expressed  by  the  inequalities: 

x2£  y2  +  e2  [5] 

y 2  £.  x2  +  e2  16) 


This  set  of  inequalities  must  be  true  for  segment  e  to  admit  the  normal  vector.  Point  p  of  Figure  10  satisfies  the 
conditions. 


Figure  10.  Ad  edge  is  admissible  if  base  angles  are  both  acute. 


In  practice,  it  is  more  likely  fa-  the  test  to  be  failed  than  to  be  passed,  so  it  makes  sense  to  test  first  for 
failure  rather  than  for  success.  The  failure  condition  may  be  written  as  the  predicate 

->  (  x2  <  y2  +  e2  a  y2  <  x2  +  e2  )  f7! 
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From  DcMorgan’s  rules,  this  may  be  rewritten 


-i  [  x2  £  y2  +  e2  ]  v  -i  [  y2  £  x2  +  e2  ]  [8] 

which  is  equivalent  to 

x2  >  y2  +  e2  v  y2  >  x2  +  e2  [9] 

If  either  side  of  disjunction  [9]  is  true,  then  edge  e  is  not  admissible  to  the  normal  vector,  and  a  potentially 
expensive  floating  point  operation  is  avoided  by  means  of  a  simple  integer-valued  decision  function.  An  example  of 
satisfaction  of  the  second  inequality  of  the  disjunction  is  illustrated  at  the  figure  below.  In  this  case,  edge  e  fails  the 
admissibility  condition,  so  that  the  normal  vector  computation  is  avoided. 


Inadmissible  Condition 

A  base  angle  is  obtuse. 


e  =  segment  of  boundary 


Figure  11.  An  obtuse  base  angle  precludes  admissibility. 


As  a  byproduct  of  the  admissibility  test,  minimal  distance  to  a  vertex  is  returned.  Consider  the  integer- 
valued  expression  (Sp-Sy)2  +  ftp-ty)2,  where  (Sy.ty)  is  the  coordinate  at  the  vertex  and  (Sp,tp)  is  the  coordinate  at  the 
query  point.  This  expression  is  synonymous  with  either  of  the  arguments  x2  or  y2  in  equations  [I]-[9]  above. 

Hence,  the  filtering  operation  as  a  side  effect  monitors  the  squares  of  the  distances  to  each  of  the  vertices  of  a 
contour.  When  the  smallest  such  expression  is  found  across  all  vertex  possibilities,  the  square  root  is  extracted.  The 
entire  process  involves  n  integer- valued  operations  for  n  vertices,  and  one  floating  point  operation,  for  a  time 
complexity  of  0[  n  ],  The  integer-valued  operation  here  involves  two  integer  multiplies,  three  integer  adds,  and  an 
integer  comparison.  The  floating  point  operation  is  a  single-shot  appeal  to  the  square  root  of  the  minimal  integer- 
valued  result 
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A  Common  Lisp  implementation  of  the  edge  admissibility  test  might  appear  as  follows: 
(defun  admissible  "  vmai-segment-p  (x  y  a,  a,  b»  by) 

;  (a,,  a,)  and  by)  are  the  endpoints  of  segment  e  in  figures;  (x,y)  is  query  point 
(prog  (dislsqr  dis2sqr  dis3sqr) 

(declare  (type  longint  x  y  a,  a,  bx  b,  dislsqr  dis2sqr  dis3sqr)> 
dislsqr  *  (dissqr  a,  a,  b,  b,) 
dis2sqr  *  (dissqr  x  y  a,  a^) 
dis3sqr  *  (dissqr  x  y  bx  b,) 

(cond  ((>  dis3sqr  (+  dislsqr  dis2sqr)Xretum  nil)) 

((>  dis2sqr  (+  dislsqr  dis3sqr)Kretum  nil)) 

(t  (return  t))))) 


Finding  the  normal  vector  with  minimum  magnitude,  across  all  segments. 

Although  we  now  have  a  test  to  determine  which  segments  of  a  boundary  admit  the  normal  vector  from  a 
query  point  we  have  not  said  anything  about  the  actual  computation  of  the  minimal  such  vector  across  all  segments, 
(n  this  section  we  develop  a  new  test  to  find  the  smallest  normal  vector,  without  resorting  to  any  floating  point 
computations.  If  the  actual  magnitude  is  desired,  two  floating  point  operations  are  required  over  the  entire  database. 
We  appeal  to  a  very  useful  result  from  analytic  geometry,  called  the  Cevian  formula  (for  a  development  see  [K 1  ]).  A 
cevian  is  defined  to  be  a  line  segment  drawn  from  a  vertex  of  a  triangle  to  the  opposite  side.  Note  that  medians, 
angle  bisectors,  and  altitudes  are  all  examples  of  cevians.  The  Cevian  formula  is  shown  in  the  figure  below,  where  n 
is  an  altitude  in  this  case.  It  is  convenient  that  the  altitude  is  equivalent  to  the  normal  vector  under  discussion.  In 
the  figure,  observe  that  rz  and  sz  are  lengths  which  sum  to  side  z,  whereas  r  and  s  are  ratios  which  sum  to  one. 


The  Cevian  Formula 

Inl2  =  ry2+sx2-rsz2, 


z  =  segment  of  boundary 


Figure  12.  The  Cevian  formula  relates  a  normal  vector  to  the  shies  of  a  triangle. 
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Unfortunately,  we  do  not  know  the  values  of  r  and  s,  because  we  do  not  know  the  point  at  which  the 
normal  vector  impacts  side  z.  In  the  equations  below,  which  until  step  [15]  echo  the  discussion  in  [Kl],  we  derive  a 
formula  for  the  square  of  the  magnitude  of  the  normal  in  terms  of  a  ratio  involving  the  squares  of  the  sides.  Steps 
[10]-[1 1]  are  a  reiteration  of  the  information  conveyed  by  the  figure.  Steps  [12]-[13]  involve  a  substitution  for  s, 
followed  by  a  reformulation  as  a  quadratic  equation  in  terms  of  r.  In  step  [14]  we  set  the  discriminant  equal  to  zero, 
because  the  roots  of  equation  [13]  must  be  non-negative  and  equal,  since  r  and  s  form  a  convex  set  Solving  this 
equation  for  n2  results  in  the  ratio  shown  at  [15],  which  but  for  the  divide  operation  is  economical  to  compute,  since 
it  involves  four  integer  multiplies  and  three  adds.  If  one  were  tuning  the  technique  with  assembly  code,  two  of  the 
multiplies  (those  involving  the  4)  could  be  converted  into  two-bit  left  shifts,  since  shifts  are  cheaper  than  multiplies. 


rz  sl  , 

r  a  s  =  — ;  r  +  s  *  1;  rt  +  st  *  z 

z  z 

[10] 

n2  m  ry2  +  sx2  -rsz2 

[11] 

n2  -  ry2  +  (1  -  r)x2  -  r(l  -  r)z2 

[12] 

z2r2  -  (x2  +  z2  -  y2  )r  +  (x2  -  n2 )  »  0 

[13] 

(z2  -  y2  +  x2)2  -  4z2(x2  -  n2)  »  0 

[14] 

2  4x2z2  -(x2  +  z2  -y2)2 

"  4z2 

[15] 

What  about  the  division  by  4z2,  which  implies  a  floating  point  operation?  The  answer  is  that  in  order  to 
find  the  normal  vector  of  smallest  magnitude,  we  may  refrain  from  performing  the  division  until  all  admissible 
segments  have  been  associated  with  a  numerator  and  denominator  as  at  [15],  and  checked  against  the  shortest  normal 
vector  found  thus  far.  The  check  is  made  as  follows.  Let  nj  be  the  normal  dropped  from  a  query  point  to  segment 
z\,  with  xi  and  yi  the  distances  to  the  respective  endpoints  of  z\.  Let  n2,  Z2,  X2,  and  yj  be  defined  similarly.  Then 
the  squares  of  the  normals  are  shown  respectively  at  [16]  and  [17],  Now  nj  <  n2  if  and  only  if  [19]  and  [20]  are  true, 
but  [20]  is  true  if  and  only  if  the  product  of  the  means  is  less  than  the  product  of  the  extremes  as  shown  at  [21]. 
Cancelling  the  common  factor  produces  test  inequality  [22].  Notice  that  if  we  are  using  integer- valued  coordinates, 
as  we  must  if  we  are  working  with  data  displayed  to  a  computer  screen,  there  are  no  floating  point  expressions 
involved  in  the  test. 


2  4xfz,2  -  (x2  +  zf  -  yf  )2 


"i  = 


2  _  4X2Z2  ~  (*;  +  z2  ~  y-j) 


4zf 

2 


,2\2 


n2  = 


n,  <  n2  <=* 

n,2  <  n*  «=> 

.2,2 


4z2 


4x2z^(xf  +  zf-y2)2  ,4x2z2-(x2  +  z2-y2)2 
4zf  4z2 

[4xfzf-(x?  +  zf-yf)2]4z2  <  [4x2z2  -  (x2  +  z2  -  y2)2]4zf  » 
z2 [4xfzf  -  (xf  +  zf  -  yf  )2]  <  z,2[4x2z2-(x2  +  z2-y2)?] 
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[17] 

[18] 

[19] 

[20] 

[21] 

[22] 
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Using  the  test  is  simple.  As  input  we  receive  a  query  point  and  candidate  segment  with  integer  coordinates. 
The  squares  of  the  distances  from  the  query  point  to  the  segment  endpoints  are  computed  with  the  usual  Euclidean 
formula,  as  is  the  square  of  the  distance  between  the  endpoints.  These  three  quantities  are  used  to  compute  the 
integer-valued  numerator  and  denominator  of  equation  [IS].  The  same  technique  applied  to  some  other  candidate 
segment  produces  another  numerator  and  denominator,  which  we  cross-multipJy  with  the  first  at  inequality  [22].  If 
the  product  of  the  means  is  less  than  that  of  the  extremes,  then  the  first  segment  is  closer  to  the  query  point; 
otherwise  the  second  segment  is  closer.  We  continue  this  process  until  all  segments  are  exhausted,  remembering  the 
segment  giving  rise  to  the  shortest  normal  vector  as  we  do  so. 

Observe  that  we  have  located  the  nearest  segment  (according  to  the  true  Euclidean  metric)  to  a  query  P0011 
without  resorting  to  any  floating  point  arithmetic.  Granted,  we  do  not  yet  know  the  magnitude  of  the  shortest 
normal  vector,  but  we  know  that  we  have  the  shortest  To  obtain  the  magnitude,  we  merely  need  to  perform  the 
division  indicated  at  equation  [IS],  and  extract  the  square  root  of  the  result  Note  also  that  we  never  had  to  compute 
any  of  the  points  of  a  line  segment;  we  were  able  to  make  do  with  the  vertices  at  its  endpoints.  This  latter  artifact 
demonstrates  the  power  and  leverage  of  the  Cevian  formula,  developed  over  three  centuries  ago.  The  formula  may 
potentially  be  used  to  assist  in  the  generation  of  the  parabolic  Voronoi  diagram  for  line  segments  and  polygons. 

We  briefly  summarize  before  moving  on  to  the  next  section.  When  the  two-dimensional  binary  search 
paradigm  requests  the  inclusion  algorithm  to  decide  whether  or  not  a  query  point  is  contained  within  a  specific 
contour,  the  inclusion  algorithm  is  handed  the  counterclockwise-oriented  set  of  contour  vertices  and  the  query  point 
as  arguments.  The  first  action  taken  by  the  inclusion  algorithm  is  to  subject  all  of  the  implied  edges  of  the  contour 
to  the  normal  vector  admissibility  test,  maintaining  the  squared  distances  to  the  vertices  on  the  side.  Generally,  the 
test  returns  just  a  handful  of  edges  admissible  to  the  normal  vector.  To  each  of  these,  the  cross-product  test  shown  at 
[22]  is  performed  to  locate  the  minimal  normal  vector.  This  quantity  is  compared  against  the  minimal  result 
obtained  for  the  vertices.  If  the  square  of  the  distance  to  an  edge  is  smaller  than  the  squared  distance  to  a  vertex,  a 
test  is  invoked  to  decide  if  the  query  point  is  to  the  left  or  the  right  of  the  edge;  if  to  the  left,  the  point  is  inside  the 
contour,  and  if  to  the  right,  the  point  is  outside.  At  this  time  the  numerator  and  denominator  of  equation  [IS]  may 
be  divided  and  the  square  root  extract  to  obtain  the  actual  magnitude  of  the  normal  vector.  If  the  squared  distance  to 
a  vertex  is  smaller  than  that  to  an  edge,  a  synthetic  edge  is  constructed  from  the  vertex's  predecessor  and  successor 
vertices  in  the  contour  boundary,  and  a  test  is  invoked  to  decide  if  the  vertex  is  to  the  left  or  the  right  of  the 
synthetic  edge;  if  to  the  left,  the  query  point  is  inside  the  contour,  and  if  to  the  right,  the  point  is  outside.  The 
square  root  may  be  extracted  to  obtain  the  magnitude  of  the  normal  vector.  The  shortest  normal  vector  points  to 
either  the  inner  or  the  outer  bracketing  contour  of  the  query  point. 


IV.  INTERPRETATION  OF  A  QUERY  POINT  IN  THE  CONTEXT  OF  A  MAP. 

Binary  search  of  a  contour  containment  graph  concludes  by  returning  the  two  bracketing  contours  of  a  query 
point.  The  algorithm  is  now  armed  with  all  the  information  it  requires  to  produce  an  "interpretation"  of  a  query 
point,  as  defined  in  the  first  section  of  the  paper.  If  either  bracket  has  inherited  the  name  of  a  mountain,  hillside, 
crater,  etc.,  for  which  the  bracket  is  a  structural  element,  then  the  name  is  available  for  simple  display,  or  for  further 
soatial  reasoning  operations  such  as  line-of-sight  or  iraversibility  reasoning.  Because  inclusion  testing  as  described 
aoove  returns  as  a  byproduct  the  normal  vector  from  a  query  point  to  a  contour,  both  the  distance  to  the  outer  bracket 
and  the  distance  to  the  inner  bracket  are  known  when  binary  search  completes.  These  two  distances  may  be  used  in 
conjunction  with  the  contour  interval  of  the  map  to  obtain  estimates  for  the  point's  elevation  and  slope.  The 
direction  from  a  hilltop  to  the  query  point,  together  with  the  elevation  values  and  orientation  of  the  bracketing 
contours,  may  be  used  to  determine  a  directional  gradient,  which  establishes  upon  which  flank  of  a  hillside  a  query 
point  resides.  The  details  involved  in  extracting  the  elevation,  the  slope,  and  the  directional  gradient  are  described 
below. 
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Deriving  the  elevation  of  a  query  point  from  its  bracketing  contours. 

Once  the  bracketing  contours  fa-  a  query  point  have  been  established,  it  is  a  simple  matter  to  compute  an 
interpolated  elevation  at  the  query  point  Without  loss  of  generality,  let  us  assume  p  is  on  an  uphill  slope  from 
outer  bracket  C  to  inner  bracket  C*.  as  depicted  at  the  figure  below.  The  elevation  of  query  point  p,  denoted  El(p), 
may  be  obtained  by  using  similar  triangles  to  compute  an  expression  which  accounts  for  p's  relative  location 
between  the  contours,  and  multiplying  it  by  the  fixed  contour  interval  of  the  map.  To  this  expression  is  added  the 
baseline  elevation  at  p's  outer  bracket  (if  p  were  on  a  downhill  slope,  the  expression  would  be  subtracted  instead). 
Special  cases  require  additional  processing.  If  a  query  point  has  an  outer  bracket  but  no  inner  bracket,  as  it  will  when 
it  resides  within  a  contour  which  represents  a  hilltop,  and  there  are  no  control  points  available  to  indicate  the  actual 
elevation  at  the  hilltop,  then  the  query  point  inherits  the  elevation  of  its  outer  bracket,  since  interpolation  is 
impossible.  If  a  control  point  is  available  (generally  obtained  by  surveyors  with  a  spirit  level,  and  represented  on  the 
map  with  an  "X”  or  a  delta  symbol),  then  interpolation  is  possible  even  in  the  absence  of  an  inner  bracketing 
contour.  One  simply  coerces  the  inner  bracketing  contour  to  be  the  control  point,  and  temporarily  sets  the  map 
contour  interval  to  be  the  difference  between  the  elevation  of  the  control  pant  and  the  elevation  of  the  outer  bracket 
Downhill  slopes,  craters,  saddles,  and  culverts  may  be  treated  with  similar  logic. 


Figure  13.  Elevation  Is  obtained  through  simple  linear  Interpolation. 


Obtaining  the  slope  at  a  query  point  from  its  bracketing  contours. 

The  local  slope  at  a  query  point  is  so  simple  to  estimate  that  even  interpolation  is  not  required.  It  is 
simply  tbe  angle  with  a  tangent  equal  to  'the  rise  over  the  run'.  The  "rise”  is  fixed,  as  it  is  given  by  the  contour 
interval  of  die  map.  The  'run”  is  defined  to  be  the  sum  of  the  magnitudes  of  the  normal  vectors  drawn  to  the  outer 
and  inner  bracketing  contours.  Atthetopofahilloratthe  base  of  a  depression,  in  a  saddle  or  a  culvert,  the  slope  is 
assumed  to  be  zero,  for  flat  ground.  However,  if  a  control  point  is  available  to  provide  additional  elevation  data,  then 
logic  similar  to  that  outlined  for  elevation  in  the  paragraph  above  may  be  utilized  to  obtain  a  refined  estimate  of 
slope.  Outside  the  limits  of  the  lowest  lying  contours,  the  algorithm  is  designed  to  return  the  string  "drainage  area', 
which  again  is  assumed  to  be  flat  ground.  There  may  or  may  not  be  a  perennial  stream  flowing  through  a  drainage 
area,  but  during  flashfloods  it  is  assumed  that  water  would  flow  there. 

Note  that  a  peculiar  thing  happens  if  we  slide  the  query  point  along  either  of  the  normal  vectors  pointing  to 
the  bracketing  contours.  Tbe  slope  remains  fixed  as  we  do  so.  This  is  the  price  we  pay  for  approximating  a  terrain 
by  a  set  of  cross-sectional  contours.  The  computed  slope  cannot  be  made  more  accurate  than  the  resolution  imposed 
by  the  contour  interval  of  the  map.  Thus,  between  any  two  nested  contours,  there  is  a  vector  field  of  slope  vectors 
which  connect  every  digital  point  of  the  inner  bracket  with  some  digital  point  of  the  outer  bracket,  and  vice  versa. 


Computing  the  Slope 
Between  Bracketing  Contours 

Slope  =  arctan  [  dy  /  dx  ] 


dy  is  given  by  the  contour  interval 

C+ 


dx  =  I  nc  I  + 1  nc+ 1 


Figure  1<*.  The  "rise"  Is  fixed,  and  the  "run*  Is  the  sum  of  tbe  normal  vector  magnitudes. 
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Obtaining  the  directional  gradient  at  a  query  point  from  its  bracketing  contours. 

Again,  assume  the  familiar  example  of  an  uphill  slope,  so  that  E1(C)  *  E1(C*>  -  k,  where  k  is  the  fixed 
contour  interval  of  the  map.  Construct  the  vector  from  the  hilltop  to  the  query  point  Define  the  hilltop  to  be  the 
control  point  at  the  top  of  the  hill  if  it  exists;  otherwise  make  it  some  reasonable  estimate,  such  as  the  centroid  of 
the  topmost  contour.  If  there  are  multiple  topmost  contours,  then  make  the  hilltop  the  centroid  of  them  all. 

Suppose  that  the  hilltop  to  query  point  vector  points  to  the  left,  as  in  the  figure  below.  Then  it  is  pointing 
downhill,  because  Cs  elevation  is  less  that  that  of  C\  and  it  is  pointing  to  the  west,  since  due  north  is  as  shown  by 
the  map.  The  query  point  is  therefore  on  the  western  flank  of  a  hillside.  Variations  on  this  theme  are  computable 
for  other  configurations  of  terrain.  If  the  elevation  of  C  is  greater  than  that  of  C\  and  we  observe  a  leftward¬ 
pointing  vector,  then  we  would  be  on  the  western  flank  of  a  crater  or  valley.  If  the  elevation  of  C  was  to  be  equal  to 
that  of  C+  and  the  vector  was  to  point  to  the  south,  then  the  query  point  would  be  on  a  saddle  or  in  a  culvert, 
oriented  in  an  east-west  fashion. 

The  vector  pointing  from  a  hilltop  to  a  query  point  is  a  suitable  gauge  of  directional  gradient  from  a  global 
perspective.  However,  a  query  point  may  be  situated  locally  on  a  geologic  feature  of  a  hillside,  with  an  orientation 
seemingly  at  odds  with  the  global  result.  For  example,  on  the  south  side  of  a  mountain,  there  may  be  a  ridge  which 
proceeds  from  the  summit  down  to  the  south.  The  ridge  will  have  both  eastern  and  western  flanks.  Suppose  for  the 
sake  of  argument  that  a  query  point  is  on  the  western  flank  of  the  ridge.  We  conclude  it  is  possible  for  a  query  point 
to  be  locally  on  a  western  flank,  but  globally  on  the  south  side  of  the  mountain.  The  local  flank  estimate  is  easily 
computed  by  drawing  the  normal  vector  from  the  query  point  to  its  outer  bracketing  contour.  The  vector  points  in 
the  compass  direction  of  the  local  gradient.  This  procedure  is  particularly  useful  for  rugged  terrain  such  as  that 
encountered  on  Mount  Rainier  in  the  state  of  Washington,  where  contour  data  tends  to  resemble  a  set  of  nested 
"octopuses". 


Computing  the  Directional 
Gradient  on  Hillsides 

Magnetic  north  is  given  by  map. 


Figure  IS.  Determining  a  query  point's  emplacement  on  a  hillside. 
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V.  AN  IMPLEMENTATION,  AND  CONCLUSIONS. 


The  theory  of  automated  topographic  map  interpretation,  as  developed  to  date,  has  been  partially 
implemented  on  a  Macintosh  Efx  workstation,  using  Macintosh  Common  Lisp,  version  2.0.  There  are  plans  to 
convert  the  code  into  C,  using  the  Symantec  Think  C  environment.  The  conversion  is  intended  not  so  much  for 
performance  purposes,  since  the  Lisp  compiler's  performance  is  favorable  when  compared  to  that  of  the  Symantec 
package,  but  to  be  able  to  control  the  process  of  garbage  collection,  which  in  the  lisp  package  is  beyond  the  reach  of 
the  user. 


There  are  two  databanks  of  contour  data:  real  and  simulated.  The  first  source  of  the  real  variety  is  a  set  of 
digital  elevation  matrix  (DEM)  data,  which  is  a  gridded  representation  of  elevation  values  sampled  at  equispaced  x  and 
y  increments.  DEM  data  is  produced  by  the  United  States  Geographical  Survey  (USGS)  office,  as  a  result  of  data 
collection  performed  primarily  by  civilian  engineers.  To  obtain  topographical  contours  from  DEM  data,  one  may 
utilize  a  geographic  information  system  (GIS)  to  extract  contiguous  points  of  equal  elevation  from  the  grid.  The 
author  used  an  on-the-shelf  GIS  package  called  Macgrizo  [Ml]  to  create  contours  for  the  Killeen  Texas  area.  The 
second  source  of  real  data,  which  is  the  military  counterpart  to  DEM  data,  is  digital  terrain  elevation  data  (DTED), 
which  currently  is  available  in  two  resolutions:  Level  I,  at  100  m eter  spacing,  and  Level  U,  at  30  meter  spacing. 

The  simulated  data  is  handcrafted  by  appealing  to  Macintosh  Quickdraw  graphics.  A  representative  terrain 
containing  four  hillsides  is  depicted  in  the  figure  on  the  next  page,  where  a  query  point  is  represented  by  the  tip  of 
the  cursor  (the  arrowhead  at  the  right).  In  this  case,  the  partitioning  algorithm  during  a  preprocessing  step  created 
level  one  and  level  two  cuts  to  segregate  the  four  hillsides.  The  level  three  cuts  and  beyond  partitioned  each  of  the 
individual  hills,  using  the  nesting  principle  described  earlier.  Now  comes  execution  time,  and  two-dimensional 
binary  search.  In  this  example,  hills  two  and  four  were  selected  in  the  first  binary  cut,  and  hill  two  in  the  second 
cut  In  the  third  cut,  it  was  determined  that  the  query  point  was  not  inside  hill  two's  forty  meter  contour,  in  the 
fourth  cut  it  was  determined  that  the  point  was  inside  the  twenty  meter  contour.  Binary  search  concludes  at  this  time 
because  the  contour  containment  graph  has  been  exhausted.  Therefore,  the  outer  bracket  is  hill  two's  twenty  meter 
contour,  and  the  inner  bracket  is  the  forty  meter  contour.  Associated  with  each  of  these  two  contours  is  the  label 
"Hni?"  The  gradient  computation  deduces  an  easterly  downhill  slope;  the  slope  is  computed  to  be  forty  eight 
degrees;  and  the  elevation  interpolates  to  thirty  throe  meters.  Currently,  the  interpretation  process  says  nothing 
about  the  relationship  among  HILL2  and  the  other  hills;  future  work  will  address  this  issue. 


Future  Work. 

The  research  to  date  has  focused  on  a  local  interpretation  of  a  query  point.  By  definition,  a  local 
interpretation  is  limited  to  a  description  of  a  query  point  in  terms  of  the  label  of  the  hill  upon  which  it  resides,  the 
two  contours  which  bracket  it,  an  interpolated  elevation,  a  slope  value,  and  a  directional  gradient  This  information 
is  useful  for  localized  reasoning  about  the  immediate  environs  of  a  query  point  A  natural  outgrowth  of  this  work  is 
to  extend  the  reasoning  to  a  more  global  capability.  For  example,  one  could  utilize  knowledge  about  the  location  of 
a  hillside  with  respect  to  other  hillsides  in  a  specific  region,  to  achieve  context-cued  line  of  sight  reasoning  or 
traversibility  planning. 

To  illustrate  line-of-sight  reasoning,  consider  the  following  example,  based  on  the  author's  personal 
experience.  In  Grand  Teton  National  Park  in  Wyoming,  if  one  is  on  the  western  shore  of  Jenny  Lake,  the  tallest 
visible  peak  is  Teewinot  Mountain,  which  looms  spectacularly  nearly  a  mile  above  the  observer's  head.  One  peak 
away  is  the  Grand  Teton,  which  although  a  thousand  feet  higher,  may  not  even  be  seen  from  this  vantage  point 
because  it  is  blocked  from  sight  by  Teewinot.  The  interpretation  process  described  in  Section  IV  would  determine 
that  the  query  point  is  on  the  eastern  flank  of  Teewinot  Mountain,  on  a  moderate  slope,  at  about  6600  feet  elevation. 
Utilization  of  the  directional  gradient  calculation  would  indicate  that  the  direction  to  die  tops  of  Teewinot  and  the 
Grand  Teton  are  roughly  the  same,  but  the  slope  of  the  segment  joining  the  query  point  to  the  top  of  Teewinot  is 
greater  than  that  drawn  to  the  top  of  the  Grand  Teton.  Hence,  one  concludes  that  line-of-sight  westward  to  the  Grand 


is  restricted  by  the  intervening  mass  of  Teewinoc  Future  work  will  involve  refining  and  formalizing  concepts  such 
as  these. 


Already,  the  normal  vector  admissibility  filtering  technique  has  been  extended  to  objects  other  than 
topographic  contours.  The  Defense  Mapping  Agency  produces  a  set  of  vector  overlays  corresponding  to  a 
transportation  network,  a  hydrology  network,  obstacles,  surface  orientation,  surface  composition,  and  vegetation 
type.  In  addition,  the  DMA  produces  a  gridded  product  called  digital  ternin  elevation  data  (DTED),  at  both  thirty  and 
one  hundred  meter  horizontal  resolution.  The  vector  products  together  with  thirty-meter  DTED  in  large  part 
comprise  what  is  known  as  tactical  terrain  data  (TTD),  a  database  being  developed  by  DMA  with  the  cooperation  of 
the  US  Army  Topographic  Engineering  Center  [M2].  The  integer-based  decision  rule  derived  from  the  law  of  cosines 
has  {woven  to  be  of  high  utility  in  gauging  proximity  and  inclusion  with  respect  to  the  multi-megabyte  vector 
databases  contained  in  TTD. 
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Conclusions 


Two-dimensional  binary  search  has  been  utilized  in  conjunction  with  two  new  algorithms  which  avoid  the 
expensive  floating  point  operations  associated  with  computing  the  normal  vector,  to  produce  an  algorithm  adept  at 
locally  interpreting  topographic  line  maps.  An  interpretation  consists  of  a  human-like  description  of  a  query  point 
in  terms  of  its  global  location,  interpolated  elevation,  local  slope,  and  directional  gradient.  The  search  algorithm 
relies  heavily  upon  proximity  and  inclusion  algorithms  developed  with  computational  geometry  research  funded  by 
the  US  Army.  For  credibility,  the  technique  is  being  leveraged  against  multi-megabyte  databases  of  contour 
information  corresponding  to  actual  tenaia  An  integer-based  decision  function  which  arbitrates  when  to  drop  the 
normal  vector  to  an  edge  (during  proximity  testing)  has  proven  to  be  extensible  to  objects  other  than  elevation 
contours,  such  as  segments  of  roads  and  streams,  and  polygons  delimiting  types  of  vegetation  cover  and  surface 
material  composition.  As  a  byproduct  of  the  research,  an  algorithm  based  on  the  ancient  Cevian  formula  has  been 
developed  to  find  the  nearest  segment  to  a  query  point,  without  using  any  floating  point  operations  whatsoever. 
New  work  will  focus  on  extending  the  definition  of  map  interpretation  to  be  more  globally  descriptive  of  a  terrain. 
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ABSTRACT 

In  many  statistical  problems,  it  is  often  difficult  to  evaluate  the  exact  distribution  of  a 
statistic.  In  such  cases  we  resort  to  asymptotic  methods.  The  paper  surveys  some  recent  results 
on  asymptotic  expansions  of  the  distribution  of  a  statistic,  with  successive  terms  providing 
improvement  in  accuracy  but  decreasing  in  magnitudes  with  orders  of  successive  powers  of  1  In 

or  l/Jn  ,  where  n  is  the  sample  size.  It  also  describes  the  methods  of  jackknife  and  bootstrap 
for  making  approximate  computations  of  some  distributional  aspects  of  estimators  and  test 
criteria.  Recent  work  on  robust  inference  is  briefly  mentioned. 
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1.  INTRODUCTION 


1.1  Exploratory  and  Inferential  Data  Analysis 

Broadly  speaking,  eliciting  information  (or  drawing  inference)  form  observed  data  is 
accomplished  in  two  stages.  The  first  is  EDA  (exploratory  data  analysis),  the  purpose  of  which 
is  to  understand  the  nature  of  the  data  and  the  underlying  stochastic  mechanism.  This  is  done 
through  a  descriptive  analysis  of  data  and  graphical  displays,  which  can  be  of  help  in  scrutinizing 
data  for  inhomogeneities,  editing,  faking  and  other  inconsistencies  and  in  suggesting  a  probability 
model  underlying  the  data.  The  second  is  IDA  (inferential  data  analysis)  whose  purpose  is  to  test 
the  appropriateness  of  the  stochastic  model  selected  for  the  analysis  of  observed  data,  to  estimate 
the  unknown  parameters  in  the  model,  and,  what  is  more  important,  to  evaluate  in  quantitative 
terms  the  uncertainty  in  the  conclusions  drawn  (inference)  from  the  data.  The  two  stages  of  data 
analysis,  EDA  and  IDA,  are  represented  in  Table  1,  with  some  additional  explanations  on  the 
generation  of  the  data  and  the  end  results  of  analysis. 

1.2  Parametric  and  Nonpaxametric  Models 

In  IDA,  the  analysis  of  data  is  dictated  by  the  chosen  (or  specified)  stochastic  model  and, 
therefore,  the  validity  of  our  conclusions  depends  on  how  accurate  the  specification  is.  In  the 
early  stages  of  the  development  of  statistical  methodology,  much  emphasis  was  laid  on  parametric 
mode!:,  and  especially  the  normal  distribution  was  taken  as  basic  to  all  quantitative  data.  Other 
more  elaborate  parametric  models  such  as  the  Pearsonian  system  of  frequency  curves  were 
introduced  but  mathematical  difficulties  prevented  their  full  exploitation  in  applications.  At  this 
stage,  attempts  were  made  to  develop  nonparametric  inference  without  assuming  any  particular 
family  of  probability  distributions,  i.e.,  under  which  the  con  lusions  drawn  remain  valid  whatever 
may  be  the  underlying  distribution.  However,  the  inference  in  such  a  case  is  not  as  precise  as 
it  would  be  under  a  valid  parametric  model. 

These  two  methodologies  are  at  the  extreme  ends  of  our  knowledge  about  the  specification 
of  the  stochastic  model.  In  the  sixties  some  new  concepts  were  introduced  by  combining  the 
parametric  and  nonparametric  inference  procedures.  It  was  assumed  that  a  basic  parametric 
model  holds,  but  the  observed  data  ma>  be  contaminated  with  extraneous  observations.  To 
accommodate  such  situations,  the  specification  is  enlarged  by  considering  mixtures  of  the  basic 
model  with  possible  contaminating  distributions.  Methods  are  then  developed  to  ensure 
robustness  against  the  contaminating  distributions.  The  different  approaches  to  specification  and 
the  nature  of  inference  associated  with  them  are  given  in  Table  2. 
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Table  1 


DATA 

COLLECTION 

TECHNIQUES 


DATA 


CROSS 
EXAMINATION 
OF  DATA 
(CED,  EDA) 


MODELUNG 


INFERENTIAL 

DATA 

ANALYSIS 

(IDA) 


FORMULATION  OF  SPECIFIC  QUESTIONS 


J 


1  1 f 


Design 

of 

ExReriments 


Historical 

(published 

materical 


5 


Sample 

Surveys 

_ 


RECORDED  MEASUREMENTS 
HOW  ASCERTAINED? 


CONCOMITANT 

EXPERT  OPINIONS 

VARIABLES 

PREVIOUS  FINDINGS 

EXPLORATORY  DETECTIVE  ANALYSIS 
(detection  of  outliers,  error,  bias,  faking,  internal 
consistency,  external  validation,  spaecial  features) 

I 


SPECIFICATION 

(cross  validation,  how  to  use  expert  opinions 
and  previous  findings,  Bayesian  analysis?) 


HYPOTHESIS 

ESTIMATION 

DECISION 

TESTING 

(point,  interval) 

MAKING 

META- 

SUMMARY 

GRAPHICAL 

ANALYSIS 

STATISTICS 

DISPLAYS 

GUIDANCE  FOR  FUTURE  INVESTIGATIONS 
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Table  2:  Inferential  Data  Analysis 


Specific 

parametric 

model 

Parametric 
model  with  a 
neighborhood 

Nonparametric 

model 

m 

Fisher  (1922) 

( 1  -a)F(9)  -kxG 
Huber  (1964) 

Wide  class  of 
distributions 

Pitman  (1937) 

•  What  is  the  ap¬ 
propriate  method 
of  data  analysis  for 
the  chosen  model? 

•  Inference  spe¬ 
cific  to  the  chosen 
model 

•  For  what  class  of 
models  a  particular 
method  is  appropri¬ 
ate  or  meaningful? 

•  Inference  on  F, 
robust  against  G, 
and  other  possible 
errors  in  data. 

•  How  best  to  use 
ranked  data? 

•  Inference  valid 
for  any  underlying 
distribution 

Too  much  reliance 
on  the  chosen 
model.  Inference 
could  be  mislead¬ 
ing 

Provides  robustness 
or  insurance  against 
possible  deviations 
from  the  assumed 
model 

Generally  does  not 
result  in  efficient 
use  of  data 

1.3  Asymptotic  Theory 

However,  in  IDA,  after  choosing  a  stochastic  model,  we  are  faced  with  the  problem  of 
computing  the  probability  distributions  of  statistics  used  as  estimators  or  test  criteria,  or  in 
general  of  evaluating  complicated  integrals  over  specified  regions  of  the  sample  space.  This  is 
not  easy  to  do  exactly  and  approximations  have  to  be  made.  This  gave  rise  to  asymptotic 
theories  of  inference  based  on  limit  theorems  which  ensured  the  accuracy  of  certain 
approximations  as  the  sample  size  increased.  However,  different  approximations  could  not  be 
compared  on  the  basis  of  limit  theorems  alone,  which  necessitated  investigations  on  the  rates  of 
convergence.  New  concepts  such  as  second  order  efficiency  (Rao  (1961)),  small  sample  (or 
higher  order)  asymptotics  (Bamdorff-Nielsen  and  Cox  (1989),  Field  and  Ronchetti  (1990))  and 
resampling  techniques  such  as  Jackknife  (Quenouille  (1956),  Tukey  (1958))  and  bootstrap  (Efron 
(1982))  were  introduced.  Much  of  the  current  research  is  centered  round  these  concepts,  which 
are  described  in  this  paper.  The  basic  results  and  techniques  of  asymptotic  theory  are  presented 
in  Table  3. 
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Table  3:  Asymptotic  Theory 


Parametric 

Non-parametric 

Normal 

Non-normal 

Exact  sample  theory 
(generally  available) 

•  Central  limit  theorem 

•  Berry-Esseen  bound 

•  Edgeworth  expansion 

•  Comish-Fisher  expansion 

•  Saddle  point 
approximation 

•  Conjuga  e  density  and 
related  techniques 

•  Second  (third)  order 
efficiency 

•  Central  limit  theorem 

•  Empirical  Edgeworth 
expansion 

Resampling  techniques 

•  Jackknife 

•  Bootstrap 

Note  1.  In  this  paper,  only  the  key  references  to  original  papers  are  given.  For  the  numerous 
other  relevant  references,  the  papers  and/or  the  text  books  in  which  they  can  be  found  are 
mentioned. 

Note  2.  The  following  abbreviations  and  notations  are  used. 

df  =  distribution  function,  pd  =  probability  distribution. 

EE  =  Edgeworth  expansion,  O  =  df  of  N(0,1) 

<J>  =  pd  of  N(0,1),  N(0,1)  =  standard  normal  variable 
iid  -  independent  and  identically  distributed 

Field  and  Rochetti  (1990)  =  FR,  Bamdorff-Nielsen  and  Cox  (1980)  =  BC 
Bhattacharya  and  Denker  (1991)  =  BD 


2.  EDGEWORTH  AND  RELATED  EXPANSIONS 
2.1  Berry-Esseen  Theorem 

One  of  the  first  results  in  asymptotic  theory  is  the  central  limit  theorem  which  says  that 
the  df  (distribution  function)  of  the  standardized  average  of  a  sample  of  n  iid  observations 
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tends  to  normal  as  n  -»  ».  The  following  theorem  gives  the  conditions  under  which  this  result 
holds  and  an  expression  to  the  upper  bound  for  error  can  be  found. 

Theorem  2.1  (Berry-Esseen).  Let  Xv-,Xn  be  n  iid  random  variables  (r.v.’s  )  with  a 
common  df  F  such  that  E(Xt)  =  0,  E{xf)  =  o2>0,  £ JX^ |3  *  p<«»..  Denote  by  Fn  the 
distribution  of  the  standardized  statistic  N  ~lr2(Xl  +~  +X^/o.  Then,  for  all  n 

sup\Fn(t)-0{t)\  <.  _i£L  (2.1.1) 

*  <fyjn 


where  d>(r)  is  the  df  of  the  standard  normal  variable. 

The  best  known  constant  which  replaces  3  in  (2.1.1)  is  0.7975.  The  result  of  Theorem 
2.1  has  been  generalized  to  non  iid  r.v.’s,  U-statistics,  linear  combination  of  ordered  statistics, 
symmetric  functions  of  iid  r.v.’s  and  so  on.  For  references  to  papers  on  the  extensions  and 
generalizations  of  Berry-Esseen  theorem,  the  reader  is  referred  to  Field  and  Ronchetti  (1990), 
which  we  indicate  by  FR  in  the  sequel. 

The  Benry-Esseen  theorem  raises  the  following  question.  Under  what  conditions  can  we 
obtain  a  complete  asymptotic  expansion 


Ajit) 

~Jn 


n 


such  that 


|f„-£  S  w 

y=o  njr 


(2.1.1) 


We  provide  some  attempts  that  have  been  made  to  solve  the  above  problem,  and  the  results  that 
are  now  known. 
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2.2  Chariier  Differential  Series 


Let  F(x)  and  G(x)  be  two  distribution  functions  such  that  all  the  derivatives  of  G 
vanish  at  the  extremes  of  the  range  of  x.  Denote  the  cumulants  of  F  and  G  by  {3r  and 
Y r,  r  -  1,2,...,  respectively.  Chariier  established  the  following  formal  expansion 


F{x)  *  exp) 


E 


( -D)r 


r\ 


\G(x) 


(2.2.1) 


where  D  denotes  the  differential  operator  and  eD  -  £  (j\VlD K 

r o 


The  series  (2.2.1)  known  as  Chariier  differential  series  enables  us  to  find  the  expansion 
for  the  df  of  an  r.v.  in  terms  of  its  cumulants  and  derivatives  of  any  given  df.  An  important 
application  of  this  idea,  which  led  to  a  number  of  important  developments  is  due  to  Edgeworth. 

2.3  Edgeworth  Expansion 

Edgeworth  (1905)  obtained  Chariier  type  expansion  for  the  distribution  of  the  average  of 
a  sample  of  n  observations  in  terms  of  the  normal  distribution  and  its  derivatives.  Modem 
versions  of  EE  (Edgeworth  expansion)  and  its  extension  to  other  statistics  are  as  follows. 

Theorem  2.2.  Let  XV”-J(n  be  iid  r.v.’s  with  a  common  df  F.  Let 
E(X,)  =  0,  E(xf)  =  CT2  <  o.  and  Fn(x)  =  [n  mX/o  <  x]. 


If  F  is  not  a  lattice  df,  and  if  the  third  moment  p3  of  F  exists,  then  uniformly  in  x 


Fn(x)  ~  +  _L  1  -j:  2)$(x)  +  o(n  ~lu). 

yfn  6a3 


Theorem  2.3.  Let  Xv~JCn  be  iid  r.v.’s  with  a  common  df  F  and  characteristic  function  y. 
Further  let 
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E(X)  =  0,  E(xf)  »  a2  <  co 


and  /„(x)  be  the  pd  of  n  ir2X/o.  Suppose  that  the  standardized  moments  X-  *  p(/<T  exist 

for  i  -  2  ,~Jc  and  is  integrable  for  some  v  5  1.  Then  fn  exists  for  and  as 

n  — »  co 


fn(x)  =  4>(x)  +$(*)£  F,(x)/n (r/2)'1 
r-3 


/  \ 

1 


n 


(W)-I 


uniformly  in  x.  Here  Pr  is  a  real  polynomial  of  degree  3(r-2)  depending  only  on  Xlf-,Xr 
but  not  on  n  and  k  (or  otherwise  on  F). 

For  instance,  the  first  few  successive  approximations  to  the  density  function  are 


/„(/»)  =  <j>(x) 


/.W 


/„(*> 


4>(x)1l 

4>U)  | 1 


+.^2_(x3~3xr 

6^ 

+ (x  3 -3x)  +  (X  4 -6x  2  +  3 )  +  (x  6  - 1 5x  4 +45x  2  - 1 5 ) 

6^  24n  72n 


Note  that  when  X3  =  0  (i.e.,  when  F  is  symmetric),  the  normal  approximation  is  accurate  to 
order  (1/n). 


EE  has  been  extended  to  U  statistics  of  degree  2  (Bickel,  Gotze  and  van  Zwet),  regression 
models  (Qumsiyeh),  maximum  likelihood  estimators  (Skovgaard),  M-estimators,  likelihood  ratios 
and  other  statistics  (Bhattacharya  and  Denker  (1990)),  which  we  indicate  by  BD  in  the  sequel, 
and  functions  of  mean  values  of  functions  of  vector  random  variables  (Bhattacharya  and  Ghosh 
(1978)).  The  results  of  Bhattacharya  and  Ghosh  have  been  recently  extended  by  Bai  and  Rao 
(1991)  to  cases  where  some  of  the  components  of  the  vector  variables  are  not  continuous. 
References  to  the  above  authors  are  given  in  FR  and  BD. 
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2.4  Comish-Fisher  Expansion 


Consider  an  estimator  Sn  of  a  real  parameter  0  lying  in  an  open  set  0,  and  let  shfn 

be  an  estimate  of  its  standard  error.  Suppose  that  G„(jr,0),  the  df  of  <fn($n-Q)/s  has  the 
asymptotic  expansion 

k-2 

G„ Cr,0)  *  <&(*)  +£  n  '*/2<?,(*;0)$(x)  +°(n  <k~2y2)  *  vFMCf;0)  +<?(«  -<*-2y2))  (2.4.1) 

r-l 

uniformly  in  x  e  and  0  in  any  compact  K  c  0.  In  (2.4.1),  qr(x;Q)  are  polynomials 
in  jc  whose  coefficients  are  smooth  functions  of  0  (say,  (k-2)  times  continuously 
differentiable  in  0).  For  purposes  of  obtaining  a  confidence  interval  of  0  or  testing  a 
hypothesis  concerning  0,  we  need  upper  and  lower  tail  percentage  points  of  G„(r,0).  We  can 
use  the  expansion  (2.4.1)  to  obtain  a  good  approximation  to  the  upper  p-fractile  by  finding 
such  that 

Gn(x^,;0)  =  p+o(n<k~m) 

or  equivalently 

=  P  +°("  _(*"2)/2)-  {2A2) 

The  equation  (2.4.2)  can  be  solved  recursively  by  starting  with  the  initial  approximation  xp,  the 
upper  p-fractile  of  the  standard  normal  distribution  in  the  form 

^  ^ « r/2c,<v0) +°(n  ";y2>-  k~2  (2A3> 


which  is  known  as  Comish-Fisher  expansion. 
For  instance 
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CiOtp;8)  »  ~q(xp,Q),  c2Up;0) 


“2~'xfl9i (V0)  +  -  <i2(xp$) 


where  q{  is  the  derivative  of  ql . 

For  proofs  and  examples,  the  reader  is  referred  to  BD. 
2.5  Laplace  Approximation 

Consider  an  integral  of  form 


g n  =  ^htydy.  (2.5.1) 

Assume  that  the  minimum  of  riy)  is  attained  at  $  €  (ajy),  that  /($)  =  0  and  r*($)  >  0 

and  that  h($)  *  0.  Then,  we  can  expand  e  ~"r^  at  j>  and  approximate  (2.5.1)  by  omitting 
terms  involving  higher  derivatives  of  riy) 


2lt  f- 


£>(?)  +(y-$W($)  +-  )4>(y-5’,[^)]-1)rfy 


-  <?  1  +0(«  -1)}. 
nr'Cy) 


(2.5.2) 


Suppose  that  r(y)  is  minimized  at  a  and  7'(y)  *  0  there.  Then 


(2.5.3) 


with  a  similar  result  if  the  r(y)  is  minimized  at  b.  The  multiparameter  version  of  (2.5.2)  is 
*„-/_<  =  e^OTatf’1'2, ,  ,0(„  -1),  (2.5.4) 
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where  it  is  assumed  that  r(y )  takes  its  minimum  at  $  in  the  interior  of  D  c  lm,  the  gradient 
is  zero  and  the  Hessian  y*($)  is  positive  definite. 

The  Laplace  approximations  (2.5.2-2.5.4)  have  proved  very  useful  in  Bayesian  inference. 
A  series  of  papers  by  Tierney,  Kass  and  Kadane  give  a  number  of  applications  involving  the 
evaluation  of  posterior  expectation  of  a  parametric  function.  For  further  details  regarding  Laplace 
approximation  and  references  to  the  above  authors,  the  reader  to  referred  to  Barndorff-Nielsen 
and  Cox  (1989,  Ch.  3),  which  will  be  denoted  by  BC  in  the  sequel,  and  Reid  (1991). 

2.6  Stochastic  Expansions 

Sometimes,  it  is  useful  to  obtain  a  stochastic  expansion  of  a  random  variable  in  terms  of 
other  variables,  which  are  easy  to  handle.  To  define  a  stochastic  expansion,  consider  a  base  set 
of  sequences  {b^},  h  -  0,1,...  such  that  as  n  -»  °°,  b^  =  o(JbhAj)  with  bQn  =  1.  Typical 
examples  are 


fy)n  55  b\n  -  lA/n*,  ^2n  “  Vn'  (2.6.1) 

b0n  *  l>  bu  =  1/n,  b ^  *  1/n2,  -  . 

Suppose  that  {f„}  is  a  sequence  of  continuous  r.v.’s  such  that 

•  *0  *¥,. ♦•••  ♦  V*  *0,(6,.,,)  (2.6.2) 


where  {X0,X1,-}  have  a  distribution  not  depending  on  n.  We  call  an  expansion  of  the  form 
(2.6.2)  a  stochastic  expansion  of  Yn.  It  is  seen  that  as  n  -»  °°,  the  distribution  of  Yn 
converges  to  that  of  X0. 

For  example,  it  is  shown  in  Cox  and  Reid  (1987)  that  if  Yn  follows  a  chisquare 
distribution  with  n  degrees  of  freedom,  then 

(Yn-n)(2nym  =  X,  *  +0(n  -1)  (2.6.3) 

3  nm 


where  X0  is  a  standard  normal  variable.  Under  some  conditions,  stochastic  expansions  and 
asymptotic  expansions  for  distribution  functions  are  equivalent.  If 
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y.  •  X0+n-'%*Op(fl-') 


(2.6.4) 


then,  as  demonstrated  in  Cox  and  Reid  (1987),  Fn(y),  the  df  of  Yn  has  the  form 

Fn(y)  *  F0(y)[l  +n  ~max{y)  +0(n  -1)}  (2.6.5) 

where 

ai(y)  -  -^ilXo^oMIFoO-))-1. 

and  £(Xj  IXq)  is  the  conditional  mean.  If  E{X^  |X0)  =0,  then  (2.6.5)  reduces  to 
F0(y){l+O(n-1)). 

Stochastic  expansions  of  statistics  are  useful  in  deriving  stochastic  expansions  of  their 
dfs.  For  instance,  in  M-estimation  (including  maximum  likelihood)  through  an  estimating 
equation,  it  is  often  possible  to  obtain  a  stochastic  expansion  of  the  estimator.  This  enables  the 
derivation  of  distributional  results  concerning  the  estimator.  For  details,  the  reader  is  referred 
to  BC,  BD  and  Cox  and  Reid  (1987). 

3.  SECOND  ORDER  EFFICIENCY 

Following  the  work  of  Fisher  (1925),  Rao  (1961)  introduced  the  concept  of  second  order 
efficiency  (SOE)  of  an  estimator.  [In  modem  terminology,  my  SOE  is  called  TOE  (third  order 
efficiency)].  If  7„(0)  is  Fisher  information  on  0  in  a  sample  of  size  n  and  7.  (0)  is  the 

corresponding  information  in  a  Fisher  consistent  estimator  tn ,  Fisher  suggested  the  expression 

e  =  lim  (/„(0-/,  (0))  (3.1) 

n-v»  " 

as  the  limiting  loss  of  information.  This  criterion  is  difficult  to  work  with.  Rao  (1961)  defined 
tn  to  be  first  order  efficient  (FOE)  if 
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<Jn(n  ~li„(x,Q)  -(3(f„(x)-e])  ->  0  in  PQ  probability  as  n  -> 


(3.2) 


where  tn(x,6)  is  the  score  (derivative  of  log  likelihood  based  on  the  sample).  The  second  order 
efficiency  is  defined  to  be  the  minimum  asymptotic  variance  of 

jnCc,6)  -/iP^CO-Q]  -Xn[tn(x)-Q]2  (3-3) 


for  suitable  choices  of  3  and  X.  Under  some  conditions,  it  was  shown  (Rao,  1961)  that  the 
minimum  asymptotic  variance  of  (3.3)  is 


«(0) 


-1  - 


rfii4r2PiiM3o 


P(0)f 


[»(0)]J 


(3.4) 


where  j'(0)  is  Fisher  information  in  a  single  observation  and 

»„  -  e( 


'l  df V 

1  d2f T 

U  *) 

V*) 

(3.5) 


where  /  is  the  pd  of  a  single  observation.  It  turned  out  that  Fisher’s  evaluation  of  (3.1)  was 
the  same  as  (3.4).  The  conditions  for  equivalence  of  the  two  definitions  remain  to  be  worked 
out  in  a  rigorous  manner.  The  result  (3.4)  is  termed  as  Fisher-Rao  theorem  by  Efron  (1975). 

In  a  later  paper,  Rao  (1962)  obtained  the  approximate  expression 

£(?n-8)2  -  +—  (3.5) 

n  „2 


when  tn  is  suitably  truncated  and  corrected  for  bias  up  to  order  (1  In).  The  minimum  value  of 
A,  *  1  li  and  of  A2  is  (3.4),  which  provides  a  decision  theoretic  interpretation  of  SOE. 
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Rao’s  work  on  SOE  was  extended  by  Pfanzagl,  Ghosh  and  Subram anyam,  Efron,  and 
Akahira  and  Takeuchi.  Some  of  these  authors  used  the  criteria  of  median  unbiasedness  and 
concentration  around  the  true  value.  References  to  these  authors  and  proofs  of  certain  statements 
can  be  found  in  BD  and  in  a  forthcoming  book  by  J.  K.  Ghosh  (1993). 


4.  SADDLE  POINT  APPROXIMATION 

A  finite  order  EE  provides  a  good  representation  of  the  df  in  the  middle  region,  but  not 
in  the  tails.  Evaluation  of  tail  probabilities  is  often  needed  in  statistical  inference.  Use  of  EE 
for  this  purpose  gives  poor  and  sometimes  negative  results.  A  promising  approach  in  such  cases 
is  the  saddle  point  approximation  introduced  by  Daniels  (1954)  originally  for  approximating  the 
density  function  of  x,  the  average  of  n  iid  r.v.’s,  JCj ,♦••,*„.  We  explain  the  technique  for  a 
general  statistic  Vn  =  Vn(Xj,*~,xB)  with  density  /n(v). 

Let  Mn(t)  and  Kn{t)  -  \agMn{t)  be  the  moment  and  cumulant  generating  functions 
of  Vn  and  Rn(t)  =  n~lKn(nt).  Further  let,  for  given  v,  t  be  the  unique  value  such  that 

R^\t)  =  v,  where  R^s)  denotes  the  s-th  derivative  of  /?„.  Then  the  density  of  V n  at  v 
is 


n  Cc*'°°e 
2m 


where  c  is  any  real  number  in  the  interval  where  the  moment  generating  function  exists. 
Expanding  at  t  =  t. 


*  2 

«[*„(t)-tv]  =  n[Rn-tv)^^^- 


(4.2) 


substituting  in  (4.1)  and  integrating  term  by  term,  we  obtain 
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4<V>  ■  /«(v)  1 


1+- 


( 

1 

P*. 

n 

8 

24 


(4.3) 


where 


Am 


V/2 

e 


n(AH-iv) 


Pm 


p™<«)  -  rff/tf.®)''2- 


The  expression  (4.3)  will  be  an  asymptotic  expansion  if  pm,  r  £  3  are  of  order  at  most  0(1) 
so  that  the  first  term  dropped  is  o(the  last  term  included). 


If  Vn  *  x,  we  obtain  the  expression  given  by  Daniels  (1954) 


n 

1/2  . 

em o-fjj  x  • 

,  _  3p4(f)-5p?(i) 

2KK{2\i) 

24n 

+0(n  _2)| 


(4.4) 


where  K(t)  is  the  cumulant  generating  function  of  xt,  an  individual  observation,  =  x, 

and  prt  ■  K^r\t)/[K^2\t)]rf2.  It  is  customary  to  use  as  the  approximation  to  fn{x) ,  a  normalized 
version  of  (4.4) 


/»<*>  “  c 


n 


K«\i) 


1/2 


e«pr(/)-/Jl(l40(n  -i)} 


(4.5) 


where  c  is  determined  so  that  the  right  hand  side  of  (4.5)  integrates  to  unity  up  to  o(rt  l). 

Expressions  such  as  (4.3-4.5)  are  obtained  by  using  methods  of  deepest  descent  for 
evaluating  the  complex  integrals  involved  or  the  method  of  exponential  tilting  or  using  the 
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conjugate  density  approach.  The  second  approach  involves  the  use  of  EE  of  the  pd  of  x  from 
the  conjugate  density  ./ty)exp[ry-/if(f)3  with  mean  £(y)  *  x. 

The  saddle  point  approximations  have  now  been  worked  out  for  a  number  of  statistics. 
For  further  details,  the  reader  is  referred  to  FR  and  a  thesis  by  Yau  (1989). 

5.  HAMPEL’S  TECHNIQUE 

Consider  the  problem  of  approximating  fn,  thep.d.of  x,  the  mean  of  n  iid  r.v.’s  with 
a  common  p.d.  /.  The  conjugate  density  of  /  centered  at  t  is 

ht(x)  »  c{i)flx)e  a(,Xx~t>  (5.1) 


where  a (t)  is  the  solution  of 

j(x-t)h((x)dx  -  0.  (5-2) 

The  existence  of  a(r)  and  its  derivatives  up  to  order  4  is  ensured  if  we  assume 

jx  re  ^Xf(x)dx  <  oo  for  r  up  to  5. 

Then 

/„(r)  =  nc  ~n(t)  f-  (hfnt-J^  x,)  II  ht(xi)dxl  •••  dxn_x  =  c  hln(t)  (5-3) 

J  J  i  i 

where  htA{t)  is  the  density  of  x  with  underlying  density  ht(x) .  Recall  that 

Eh{x)  =  r,  Vh (x)  =  c At)in.  (5.4) 

Hence  htjn(t)  can  be  approximated  by  n  m/j2n o(r) ,  and  h'n(t)/ht  n(t)  by  o' (t)/<J(t) ,  each 
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with  errors  of  order  1  In.  The  term  n  ~1/2  disappears  since  we  arc  evaluating  the  density  at  the 
mean.  From  this 


-n 


c'(t) 

c(t) 


o'it) 

o(r) 


+0(rt-l) 


-na(t)  - 


o'(0 

a(t) 


+0(n- 1). 


(5.5) 


Using  (5.5),  fn(t)  is  obtained  by  integration,  which  may  have  to  be  done  numerically.  In  FR, 
the  approximation  (5.5)  for  /„'//„  is  shown  to  be  extremely  good  compared  to  others. 


6.  EVALUATION  OF  TAIL  PROBABILITIES 

Tail  probability  approximation  is  another  area  of  great  development.  One  method  is  to 
use  a  good  approximation  to  the  pd  of  a  statistic  and  obtain  the  tail  area  by  numerical  integration. 
Another  is  to  use  the  saddle  point  approximation  directly  to  the  integral  representing  the  tail 
probability.  A  third  method  is  to  evaluate  the  integral  for  the  tail  probability  directly  by 
numerical  integration. 

Consider  the  example  given  in  FR  from  the  paper  by  Helstrom  and  Ritcey  (1984).  The 
problem  is  to  evaluate  the  tail  probability  of  Tn  >  x0  where 


(6.1) 


and  Xj  -  N(Sj,  1),  y}  -  N(tj,  1)  are  iid  r.v.’s.  It  is  seen  that  under  the  non-null  hypothesis,  the 
moment  generating  function  of  Tn  is 


Mn(t)  =  (1  -t) -ne m "*> 


(6.2) 


2  2  (5) 

when  s  =  (Z  s  +Z  r  )/2.  If  and  t;  are  themselves  r.v.’s  with  as  the  moment 

J  J  J  J  n 

generating  function  of  S,  then 
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(6.3) 


Mn(f)  = 


The  pd  of  Tn  is 


(6.4) 


and  the  tail  probability  is 

j>*  -  ^  <&5> 


where  wrt(r)  *  -logr-fic0+ATn(r),  ATn  «  logA/n.  We  have  two  alternatives;  either  to  use  saddle 
point  approximation  by  expanding  the  integrand  in  (6.5)  around  the  saddle  point  and  use  the 
method  of  steepest  descent  or  to  apply  numerical  integration  directly  to  (6.5)  for  evaluating  the 
integral  along  the  contour  defined  in  the  Qc,y)  plane  by  Imag[x+iy]  =0.  It  is  found  that  the 
latter  approach  gave  slightly  better  results.  Perhaps,  with  the  development  of  accurate  numerical 
integration  methods,  the  need  for  asymptotic  expansions  will  become  less  important. 

For  further  details  on  the  subject  and  attempts  at  finding  highly  accurate  explicit 
approximation  formulas  for  tail  probabilities  stemming  from  the  work  of  Luganani  and  Rice 
(1980),  the  reader  is  referred  to  FR  and  Yau  (1989). 


7.  JACKKNIFE 

Let  xV'-,xn  be  iid  r.v.’s  having  a  common  df  F,  and  6  be  an  estimate  of  a  parameter 
0.  As  a  first  step,  one  would  like  to  know  the  properties  of  6  in  terms  of  bias  and  variance. 

The  Jackknife  methodology  assumes  that 


£(6-0) 


n~l  (n- 1)2 


n 


n 


(7.1) 
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where  §_,•  is  the  estimator  based  on  all  the  observations  except  xi .  The  expressions 


(7.2) 


are  called  pseudvalues,  and  their  average  value 

bj  -  nb-(n-\)b. 


(7.3) 


where  9.  =  (9_,  +...  +9_rt)/n,  is  called  the  Jackknife  estimator  of  9.  It  is  seen  that  the  bias  in 
bj  is  of  the  order  of  1  In2. 

Tukey  (1958)  suggested  an  estimate  of  the  variance  of  bj  in  the  form 

V(bj)  =  Wn-DJ-1^  (9,-6 /  =  (n-l)n-1^  (9_r9.)2.  <7*4> 

l  l 


and  conjectured  that  the  statistic 

bj-e 

[V(Qy)]112 


(7.5) 


has  approximately  a  t  distribution  with  (n-1)  degrees  of  freedom.  The  results  (7.4)  and  (7.5) 
may  not  be  valid  in  all  cases,  but  are  found  to  provide  good  appioxiuiatioiis  in  cases  of  regular 
estimators  which  are  asymptotically  locally  linear. 

More  generally,  we  can  delete  d  observations  at  a  time  and  compute  the  estimate  based 
on  the  remaining  set  of  observations.  Let  9f  be  such  an  estimate  for  a  particular  deletion  (s) 
and  b  be  the  estimate  obtained  from  all  the  observations.  Then  an  estimate  of  V(6)  is 
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dl(n-d)ir 

n\d 


E  <M>2 


(7.5) 


where  the  summation  is  over  ail  possible  deletions  of  d  observations  and  r  -  n  -d.  When  8 
is  not  a  sufficiently  smooth  function,  then  the  expression  (7.5)  with  a  large  d  is  a  better 
estimator  of  V(S)  than  (7.4)  based  on  d  *  1 . 

Further  details  and  references  to  the  key  contributors  to  Jackknife  beginning  with  the 
pioneering  contribution  by  Quenouille  (1955)  can  be  found  in  a  survey  paper  by  Peddada  (1993). 

8.  BOOTSTRAP 

Bootstrap  is  another  tool  like  jackknife  for  studying  the  properties  of  estimators.  It  is 
more  general  than  jackknife  and  can  handle  problems  where  jackknife  fails. 

Let  0  be  a  parameter  of  a  df  F  and  tn  -  tOcx,~'jcJ  be  an  estimator  of  0  based  on 
iid  r.v.’s,  xv-,xn  from  F.  We  wish  to  know  the  characteristics  of  tn  such  as  the  following, 
under  repeated  sampling  from  F. 


Bias:  Epitn)  -0  *  b. 

(8.1) 

Variance:  Epit^)  ~[Ep(tn)f  =  v. 

(8.2) 

Tail  area:  PF(tn2y)  =  py 

(8.3) 

The  actual  evaluation  of  mese  quantities  is  extremely  complicated  even  under  simple  forms  for 
F .  Bootstrap  is  a  general  resampling  technique  to  obtain  their  approximate  values. 

The  first  step  in  bootstrap  methodology  is  the  estimation  of  F  based  on  the  sample.  In 
the  nonparametric  situation  F  is  simply  estimated  by  Fn  =  Fn,  the  empirical  df.  In  the 
parametric  situation,  if  Fix)  -  F(x;Q)  where  F  has  a  known  form  but  0  is  an  unknown 
parameter,  then  an  estimate  of  F  is  Fn  =  F(; t;0)  where  @  is  an  efficient  estimate  of  0. 


214 


Wc  now  consider  Fn  as  our  population  and  denote  by  (x*,-,x*) ,  n  iid  observations 
from  Pn.  Define  t*  -  t(x*,-,x*)  and 


EP-t-b- 


^„(0-[^(0r  = v’ 

*  Py- 


(8.4) 

(8.5) 

(8.6) 


Since  Fn  is  completely  known,  the  quantities  (8.4)-(8.6)  can  in  principle  be  computed,  although 
it  may  not  be  easy. 

Efron  (1975)  suggested  approximating  them  by  simulation.  That  is,  we  repeatedly  draw 
samples  (x’,--,x*)  from  fin  and  compute  t  *  for  each  sample.  Suppose  that  we  have  drawn 
B  samples,  in  which  case  we  have  B  values  of  t  * 


h 


(8.7) 


Then  b  *  and  v  *  are  estimated  by  the  average  and  variance  of  the  values  in  (8.7).  We 
represent  them  by  b  and  v  respectively.  Similarly  we  can  find  the  proportion  of  values  in 
(8.7)  which  are  equal  to  or  greater  than  y ,  which  is  represented  by  p  *.  It  is  claimed  that  when 
B  is  sufficiently  large. 


L.  *  U  -  *  * 

b  -  b,  v  ~  v,  py  -  py. 


(8.8) 


The  claim  is  substantiated  by  empirical  studies  and  theoretical  investigations  on  the  rates  of 

convergence  of  the  bootstrap  estimates  b*,  v*  and  py  to  the  true  values  b,  v  and  py 
respectively. 


The  bootstrap  methodology  can  also  he  used  to  find  the  null  distribution  of  certain  test 
criteria.  For  instance,  let  the  test  criterion  for  testing  that  the  mean  of  F  has  a  specified  value 


Po  be  t  -  Jn  (x-]1q)/s,  where  s  is  the  standard  deviation  computed  from  the  sample.  We 
need  the  distribution  of  t ,  or  certain  percentile  points  of  its  distribution. 

Let  Fn  be  an  estimate  of  F  and  be  a  sample  from  Fn.  Further,  let  x  * 

and  s  *  be  the  average  and  standard  deviation  of  Construct  the  statistic 

r*  »  \fn(x*~x)/s  where  x  is  the  average  of  the  original  sample.  We  obtain  the  distribution 
of  t*  by  drawing  repeated  samples  from  fin.  Then 

Pfi(tm>y)  -  Pp(t>y).  (8.9) 

Further  details  and  references  to  numerous  key  papers  on  bootstrap  can  be  found  in  a  survey 
paper  by  Babu  and  Rao  (1993). 


9.  ROBUST  INFERENCE 

It  has  been  known  for  some  time  that  certain  statistical  procedures  are  sensitive  to  slight 
departures  from  the  assumptions  on  which  they  are  based.  But  it  is  only  in  the  sixties  and 
seventies,  systematic  attempts  were  made  to  develop  robust  techniques.  One  of  the  main 
concerns  v/as  die  effect  of  outliers,  gross  errors  and  contaminating  observations  on  estimators. 
To  study  this,  new  concepts  such  as  influence  function,  breakdown  point  and  qualitative 
robustness  were  introduced. 


As  an  alternative  to  the  least  squares  method  for  the  estimation  of  parameters  in  a  linear 
model,  which  is  unduly  influenced  by  outliers,  new  methods  of  estimation  known  as  M,  L  and 
R  were  introduced.  The  most  popular  among  them  is  M-estimation  based  on  a  suitably  chosen 
estimating  equation.  For  example,  to  estimate  a  location  parameter  0,  an  equation  of  the  type 


EH 

i=l 


*re 


V 


=  o 


(9.1) 


is  used,  where  x^,-,xn  are  iid  observations,  0  is  an  unknown  location  parameter  and  s  is 

some  estimate  of  the  scale  parameter.  To  reduce  the  effect  of  outliers,  y  functions  of  the 
following  forms  shown  in  the  accompanying  diagram,  are  recommended  instead  of  the  endless 
line  passing  through  the  origin  at  45°,  which  is  the  \j/  function  for  least  square  estimation. 
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For  details  concerning  robust  inference,  the  reader  is  referred  to  the  thesis  of  Ronchetti 
(1982)  and  the  book  by  Hampel,  Ronchetti,  Rousseeuw  and  Stahel  (1986).  We  can  also  consider 
M-estimation  by  introducing  a  loss  function  and  determining  8  such  that 


n 


E  «*,-«> 


(9.2) 


is  a  minimum.  In  recent  papers,  Bai,  Wu  and  Rao  (1992)  and  Bai,  Liu  and  Rao  (1992) 
considered  \  functions  of  two  types.  One  is  the  difference  of  two  convex  functions  and 
another  is  non-increasing  in  the  range  ( -®°,0)  and  non-decreasing  in  the  range  (0,«>) .  With 
such  loss  functions,  which  cover  a  wide  variety  of  shapes,  an  asymptotic  theory  is  developed  with 
a  minimum  number  of  assumptions. 

Note  1.  Asymptotitis:  In  the  initial  stages  of  development  of  statistics,  much  emphasis  was  laid 
on  small  sample  theory,  an  exact  treatment  of  which  was  possible  under  the  normal  distribution. 
Most  of  the  current  research  papers  deal  with  nonparametric  or  semiparametric  situations  and 
present  asymptotic  results.  No  attempt  is  made  to  examine  how  useful  and  relevant  these  results 
are  for  application  in  actual  practice  where  we  meet  with  only  finite  (small)  samples.  This 
phenomenon  is  aptly  described  by  Tukey  (1993)  as  Asymptotitis.  Perhaps,  we  should  be 
addressing  ourselves  to  questions  like:  What  sample  sizes  suffice  to  make  asymptotic  theory 
useful  (or  even  relevant)? 

Note  2.  Most  of  the  expansions  considered  in  the  literature  use  the  limit  distribution  of  the 
statistic  under  consideration  as  the  leading  term.  There  are  other  possibilities  like  the  one 
considered  by  Rao  (1951),  which  are  likely  to  give  improved  results.  Further  research  in  this 
direction  would  be  useful. 
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The  variational  theory  of  three  dimensional  motion  of  curved  twisted  and  extensible  elastic 
rods  is  obtained  based  entirely  on  the  kinematical  variables  of  position  and  rotations.  The 
constitutive  relations  that  define  the  resistive  couples  and  the  axial  force  as  gradients  of  the 
strain  energy  function  are  established.  A  candidate  for  the  strain  energy  function,  derived 
on  the  basis  of  classical  assumptions,  is  presented. 


INTRODUCTION 


In  the  Historical  Introduction  of  the  "A  Treatise  on  the  Mathematical  Theory  of 
Elasticity,"  Love  f  1892)  narrates  that  in  1742  Daniel  Bernoulli  wrote  to  Euler  suggesting 
that  the  differential  equation  of  the  elastica  could  be  found  by  making  the  integral  of  the 
work  done  or  the  square  of  the  curvature  a  minimum.  Acting  on  this  suggestion  Euler  was 
able  to  obtain  the  differential  equation  of  the  elastica  and  the  various  forms  of  it.  Thus  the 
concept  of  the  strain  energy  was  born  and  the  foundation  of  the  variational  theory  of  elastic 
rods  were  laid  out.  The  equilibrium  equations  that  were  much  later  developed  by  Love  are 
applicable  to  an  initially  bent  and  twisted  rod. 

Our  aim  in  this  paper  is  to  establish  a  variational  formulation  for  the  title  problem  and  in 
the  process  infer  the  existence  of  the  strain  energy  function  and  determine  the  constitutive 
relations  that  relate  this  function  to  the  bending  and  twisting  couples  and  the  axial  force 
within  the  rod.  This  development  together  with  equations  of  motion  and  the  geometry  of 
deformation  define  a  direct  approach  and  an  exact  nonlinear  theory  for  the  three 
dimensional  motion  of  a  one  dimensional  elastic  medium  capable  of  resisting  bending 
twisting  and  extension.  Going  a  step  further,  in  order  to  actually  construct  an  explicit 
form  for  the  strain  energy  function,  we  enter  the  realm  of  hypothesis  and  use  KirchhofPs 
description  of  deformations  in  a  thin  rod.  This  view  enables  us  to  determine  a  strain 
energy  function  that  can  be  used  in  engineering  applications. 

The  recent  history  of  investigations  of  the  rod  theories  consists  of  developments  along  two 
separate  streams,  the  direct  approach  and  approximations  from  three-dimensional 
continuum.  In  the  direct  approach  a  one-dimensional  continuum  view  is  pursued  and  the 
medium  is  supposed  endowed,  in  addition  to  its  position,  with  vector  fields,  the  directors, 
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that  are  to  be  interpreted  appropriately  to  define  bending,  twist  and  extension  properties  of 
a  rod.  This  approach  has  its  origin  in  the  work  of  E.  and  F.  Cusserat  (19091  and  numerous 
investigations  have  contributed  to  it  among  them  Naghdi  (1982),  Nagndi  and  Rubin 
(1984),  Whitman  and  DeSilva  (19701,  Green  and  Laws  (1966),  and  Eriksen  (1970). 
Extensive  investigation  into  the  qualitative  aspects  of  the  nonlinear  theory  such  as 
questions  of  existence  of  solutions  and  global  behavior  have  been  carried  out  by  Antman 
(1976).  His  basic  work  entitled  "The  Theory  of  Rods"  (1972)  describes  these  theories  both 
as  approximations  to  the  three-dimensional  continuum  theory  and  as  a  one-dimensional 
continuum  with  directors.  The  work  presented  here,  although  pertains  to  a 
one-dimensional  continuum  does  not  use  directors,  but  is  formulated  entirely  on  the  basis 
of  kinematical  quantities  consisting  of  the  position  vector  of  points  along  the  curve  of 
centroids  and  the  orientation  angles  of  the  cross  sections  of  the  rod  relative  to  a  fixed 
coordinate  system.  It  is  a  generalization  of  the  work  of  Tadjbakhsh  (1966)  in  which  the 
theory  of  planar  motion  of  the  extensible  elastica  was  described. 

The  history  of  construction  of  approximate  theories  in  the  context  of  three-dimensional 
nonlinear  continuum  theory  is  also  varied  and  to  it  many  investigators  including  some  of 
the  above  authors  have  contributed,  see  for  example,  Naghdi  and  Wenner  (1974). 


KINEMATICS 


An  elastica  is  a  nearly  uniform  slender  rod  of  finite  length.  In  the  unstressed  state  the 
centroids  of  the  cross  section  form  a  space  curve  C  that  is  called  the  reference  curve  with 
an  arc  length  s.  The  orientation  of  the  principal  axes  of  the  cross  section  vary  continuously 
along  the  rod.  This  means  that  in  the  unstrained  state  the  rod  has  arbitrary  twist  and 
curvatures.  With  respect  to  an  inertial  Cartesian  frame  x  the  position  of  a  point  s  in  the 
unstrained  state  is  denoted  by  X  =  Xi(s)nj,  i  =  1,2,3,  with  ni  being  the  dextral  unit 
vectors  of  the  frame  x. 

The  cross  sectional  area  can  be  slowly  varying  function  of  s  and  will  be  denoted  by  AJs). 
As  the  rod  deforms  the  curve  C  acquires  new  configuration  c  that  changes  with  time.  The 
arc  length  along  c  is  denoted  by  (  that  depends  upon  s  and  t,  i.e.  f  =  £  (s,t).  The  position 
of  a  point  s  on  c  at  an  arbitrary  time  is  x  (s,t)  so  that 

x(s,t0)  =  X(s),  £(s,t0)  =  s  (la,b) 

where  t0  is  a  reference  time  at  which  the  rod  is  in  the  unstrained  state.  Also 

IK -(*;*;>»  w 

where  prime  denotes  differentiation  with  respect  to  time  and  summation  over  a  repeated 
index  is  implied.  The  strain  e  is  defined  by 

e  =  t'  -  1  (3) 

f 

where  e  >  0  denotes  extension  and  e  <  0  contraction.  The  strict  positivity  of  £  implies 
that  -1  <  e  <  ®. 
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Attached  to  any  point  s  of  c  a  Cartesian  coordinate  frame  y  will  be  assumed  and 
will  be  referred  to  as  the  body  reference  frame.  The  coordinate  axes  of  the  body  frame  are 
y j,  y2,  y3  with  the  y3  axis  pointing  in  the  direction  of  increasing  s  and  yt  and  y3  being  the 

principal  axes  of  the  cross  section.  The  dextral  unit  vectors  of  the  y  frame  will  be  denoted 
by  ej,  i  =  1,2,3,  with  their  orientations  at  the  reference  time  t0  being  Ej. 

Denoting  by  ljj(s, t)  and  Ljj(s)  the  elements  of  the  matrices  of  direction  cosines  of 
the  dextral  sets  e^  and  Ej  one  has 

®i  =  ^ij  ej  =  Lij®j  (4) 


and 


ei  =  ly  nj  .  Ej  =  Ljj.nj  (5a,b) 

The  angles  represent  rotations  between  the  corresponding  pairs  of  Ej  and  ej  when 

these  directions  are  assumed  to  issue  from  a  common  origin,  Fig.  1.  These  angles  are 
determined  through 

cos^j  =  ■  Ej  =  ljj  Ljj  ,  i=l,2,3  (sum  only  on  j)  (6) 

The  direction  cosines  1^  are  characterized  non-uniquely  by  three  orientations  angles 
^(s,t),  02(s,t)  and  03(s,t).  These  angles  can  be  selected  in  a  variety  of  ways  and 
represent  three  finite  rotations  about  unit  vectors  ej  or  ni.  If  these  rotations  are  properly 
selected  the  fixed  orientation  iij  may  be  brought  to  any  arbitrary  body  orientation  ej. 
Kane  et  al.  (1983)  list  at  least  24  possibilities  for  the  order  of  rotations  of  the  angles  9V  6V 
03  about  the  body  set  of  unit  vectors  ej  or  the  fixed  set  of  unit  vectors  Uj. 

Regardless  of  the  particular  choice  of  orientation  angles  the  angular  velocity 
u i  —  o/jCj  of  a  cross  sectional  element  Ads,  Fig.  1  of  the  rod  is  determined  uniquely  from 

ui=ThtfxUglih  ’  (‘  =  If)  (7) 

where77jjk  =  ejjk  («jjk  +  l)/2,  (no  sum  on  i.j.k)  and  Cjjk  is  the  alternator  tensor  with  the 
non— zero  components  el23  =  «231  =  e321  =  +  1  and  €132  =  e321  =  e213  =  —1. 

The  curvature  vector  K  =  Kiej  of  the  rod  can  be  defined  in  a  similar  way  with  K3 

representing  twist  and  Ktand  ^representing  bending  curvatures  about  the  principal 
directions  of  the  cross  section.  Using  the  dynamical  analogy  of  E.I.  Routh,  Love  (1944)  has 
noted  that  if  the  frame  y  were  to  move  with  unit  speed  along  the  curve  c  such  that  at  any 
point  £  of  c  it  has  the  orientation  of  the  y  frame  at  that  point  then  the  angular  velocities 
and  o/j  will  be  the  principal  curvatures  Ki  and  K2  of  the  rod.  Also  the  angular  velocity 
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will  be  the  twist  curvature  K3  of  the  rod.  Thus  the  formulas  that  define  the  angular 
velocities  from  direction  cosines  can  be  used  to  determine  curvatures,  provided  time 


differentiation  is  replaced  by  differential  with  respect  to  f .  Therefore  one  has 

Ki  =  r/igh  lih  =  r?igh  l  ig  lih  =  >  (8) 

where  differentiation  with  respect  to  £  has  been  replaced  with  differentiation  with  respect 
to  s  and  curvature  parameters  ki  =  (l+e)Ki  is  also  introduced.  For  future  use  one  may 
note  the  formulas  for  derivatives  of  direction  cosines  ly  and  the  unit  vectors  ei. 

lij  —  (1+e)  Cghj  lig  kh  i  ij  =  fghj  lig  wh  (9a, b) 

/  * 

=  ckji  kj  ek  ei=fkjiWjek  (10a, b) 

if  Ki  be  the  curvatures  of  the  rod  in  the  unstrained  state  (e  =  0)  then  from  (8) 

Ki  =  Vigh  big  Lih  (11) 

Since  £3  and  drfd£  are  both  unit  tangents  to  the  central  line  one  has 

x'i  =  (l+e)li3  (12) 


We  assume  that  the  center  of  mass  of  the  cross  sections  coincide  with  the  centroids. 
The  linear  and  the  central  angular  momentum  per  unit  length  are  then  given  by 


p  =  pA  Xini 


(13) 


and 


H  =  plu  =  p(In  wiei  +  I22  u^2  ©2  +  I33  ^363)  (14) 

where  p  is  the  mass  density  per  unit  unstrained  length  and  I  is  the  diagonal  moment  of 
inertia  tensor  with  components 

In  =  /yz  dA,  I22  =  /yidA,  I33  =  In  +  hi  (15) 

A  A 


Equations  of  Motion 

Referring  to  the  body  set  of  axes  e;  one  can  define  the  vector  F  of  the  resultant 
shear  stresses  Fi  and  F2  and  the  axial  stress  resultant  F3.  Similarly,  one  may  define  the 
couple  stress  vector  M  consisting  of  the  bending  moments  Mi  and  M2  and  the  torque  M3. 
Explicitly  we  have 

F  =  Fiei  and  M  =  M*i  (16) 

The  well  known  dynamic  equilibrium  of  the  rod  can  be  expressed. by  the  equations 
of  the  balance  of  linear  momentum 
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and  of  the  balance  of  angular  momentum 

M'  +  x'  *  P  +  m  =  H  (18) 

wherein  f  and  m  represent  distributed  force  and  moment  acting  on  the  rod.  The  scalar 
components  of  these  equations  can  be  referred  to  the  body  set  of  axes.  For  this  purpose  one 
needs  to  express  all  vector  quantities  in  terms  of  unit  vectors  ei  and  use  (9)-(l0).  Then 


(17)  becomes 

F  t  +  k2F3—  k3F2+  f^  pA  Xj  lj  t  (19a) 

fj  +  k3Fl~  klF3+  *3  =  PA  Xj  ijt  (19b) 

F3  +  kiFj-  k2Fl+  *3  =  pA  Xj  lj3  (19c) 

while  (18)  assumes  the  form 

M  t  +  kjMj  -  kjMj  -  (l+e)F2  +  m^  =  pl^  +  w2w3)  (20a) 

Ma  +  kjMt  -  kjMj  -  (l+e)Ft  +  mj  =  pI2(a/2  +  w^)  (20b) 

M3  +  kjMj  -  kjMj  +  m3  =  p[Jw3  +  (I2  -  IJwjO/J  (20c) 


where  It  =  Iu  ,  I2  =  I22  and  I  =  J33  =  It  +  I2.  The  superscript  y  on  the  components  of  f 
and  m  denote  the  components  of  these  vectors  in  the  body  reference  frame. 

To  express  the  equations  of  motion  in  the  inertial  frame  we  introduce  the 
components  of  the  stress  resultants  in  that  frame.  Thus 

Fi  =  ly  fj  M*  =  ly  Mj  (21a, b) 

Then  (19)  becomes 

F*  +  $  =  f  Ax,  (22) 

and  (20)  assumes  the  form 

M*  -(l+e)F2  +  m*  =  p(Irj  ltj  uT  +  egrj  Isj  llg  w5wr)  (23a) 

M2  -{l+e)F t  +  m2  =  p(Irj  l2j  ujt  +  egrj  Isj  l2g  usuT) 


(23b) 


(23c) 


Ms  +  ms  =  p(Ijj  ljj  o/r  +  Cgrj  Isj  l3g  usuT) 

Either  of  the  set  of  equations  (19W20)  or  (22)-{23)  can  be  considered  as  the  governing 
differential  equations  of  motion.  These  equations  will  have  to  be  supplemented  with 
constitutive  relations  that  define  resultant  axial  stress  Fj  and  the  resultant  bending  and 
twisting  couples  Mi,  M2,  M3  in  terms  of  the  axial  strain  e  and  curvatures  k2>  k2,  k3.  In  the 
next  section  we  consider  the  derivation  of  these  constitutive  relations. 


Constitutive  Relations 

We  assume  that  the  motion  of  the  elastic  rod  is  equivalent  to  the  stationarity  of  the 
Hamiltonian  H  which  is  defined  by 


f 2  82 

H(lij  (ip),  xi,  e]  =  ff  Jftsdt  + 

1 1  s  i 


t2 

/  [ptX, +  «,*,] 


tt 


S2 


Si 


32  t2 

dt  -/  [Mv^]  ds- 

Sl  tj 


sa. 

/  p[li  <px  +  Ij  ip2  +  Jw3  <p3 
Si 


where  Jfis  the  action  density  function 


ipK  x,  xl  +  ip  (1,  w,  +  I,  w,  +  J  uj  )  -w  (e,  kj  + 


(24) 


*l(lu  (1+e)  -  *'i  1  +  A  *i  +  m\  *>, 


(25) 


-t  -1  -2  -2 

and  Fj ,  Mt  and  Fj ,  Mt  are  the  applied  forces  and  moments  at  ends  st  and  s2  respectively. 


-12  -12 

Also  Vj’  and  are  the  initial  and  final  linear  and  angular  velocities.  The  strain  energy 


function  w  depends  upon  the  kinem&tical  variables  e  and  k2.  The  precise  nature  of  this 
dependence  is  the  constitutive  relations  that  we  seek  and  is  a  consequence  of  the 
stationarity  of  H.  The  functions  Ai  are  the  Lagrange  multipliers  that  allow  the  constraints 
(12)  to  be  incorporated  within  the  Hamiltonian.  As  a  result  x\  and  ly  can  be  regarded  as 
independent  variables.  Additionally  the  constraint  (12)  implies  the  definition  (2)-{3)  for 
the  strain  e  and  hence  in  (24)  e  can  also  be  viewed  as  an  independent  variable.  To  see  this 
we  need  to  note  that  if  each  side  of  (12)  is  multiplied  by  itself  we  obtain  x'  x\  = 

(l+e)31i3lij  =  (l+e)*  which  is  restatement  of  (2)-(3).  The  terms  x*  and  m2  <pi  in  (25) 


represent  the  density  of  the  potential  of  the  applied  forces  and  moments  on  the  rod.  As 
stated  in  (5)  the  angles  <pi  are  the  rotations  from  Ej  to  e*. 
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With  these  preliminaries  we  note  that  the  Euler  equation  corresponding  variations 

/  t  X 

§x[  is  simply  +  f7  =  pk  which  when  compared  with  (22)  reveals  that  Aj  =  F^  Next 
considering  the  variations  with  respect  to  e  we  obtain 


5W 

7F 


(26) 


which  is  the  constitutive  relationship  determining  the  axial  force  F3  as  the  derivative  of 
strain  energy  with  respect  to  axial  strain  e. 


We  now  turn  to  the  Euler  equation  corresponding  to  the  variation  5<pi.  For  this 
purpose  we  note  that  ljj,  and  ki  depend  on  For  orientation  angles  of  the  cross 
section  we  select  the  sequence  of  body  rotations  first  0& 2,  second  03e3  and  the  third  Ofii 
with  0i  =  ipi.  In  this  sequence  the  last  rotation  is  through  <pi  with  respect  to  which 
variation  is  sought.  The  matrix  1  of  the  direction  cosines  is  given  by 


1=B(02)C(03)A(0,)  = 


C2C3  — C  i C 2 S 3  S |S 2  S1C2S3  +C1S2 
S3  C,C3  -stc3 

-S  2C3  C  1  S  2S3  +  S1C2  — S 1 S  2 S  3  +  CiC2  J 


(27) 


where 


Ci  =  Cos  9\ ,  Si  =  Sin  0j 


'  1 

0 

0  ‘ 

COS0 

0 

sin0 

A(0)  = 

0 

COS0 

-sin0 

B<0}  = 

0 

1 

0 

0 

sin0 

COS0 

-sin0 

0 

COS0 

cos0  -sin0  0 


C(0)  = 


sin0  cos0  0 


0  0  1 


(29) 


Subsequently,  we  find  from  (7)  and  (8) 


Wl=02  S3  +  01,  W2=02  C1C3  +  03  Si,  U>3=— 02  S1C3  +  03Ci  (30) 

ki=02  S3  +  0i,  k2=02  C1C3  +  03  Si,  k3=— 02  S1C3  +  03Ci  (31) 


227 


From  (24)  we  have 


f  3W  Ski  ,  9  /3W  3k  ^  {din  d  ,51  mil 

{"•^rr^  +  58(FEi^T)  +  (1+e)Fi 


y 

+ 


(32) 


Noting  that  0i  =  tpi  we  have  from  (30)  dui\jd<p\  —  i,  dwa/ d<p\  —  wj ,  du3/d<pi  —  —  w*. 

Also  from  (31)  flki/5^1  =  0,  dkifotpi  =  k3,  dkz/d<pi  —  —ka  and  from  (27)  dli%ld<p\  =  — li2- 
Using  these  results  and  the  inverse  of  (21a),  (32)  becomes 


voiug  bucoc  icouiwo  cuiu  me  in v ci ac  ui  y  uc^giuco 

(|y^)  +  ka^Y3  ~  ^3'f^  — (l+e)F 3  +  m^  =  pl|(u>i  +  W2W3) 


(33) 


In  exactly  the  same  manner  one  can  proceed  0  determine  the  Euler  equation  for  a 
variation  Sip*  Now  the  consecutive  sequence  of  body  rotations  0363,  Ofii  and  3^2  is 
selected  with  0a  =  <pr  Without  going  into  details  one  obtains 

(g^~)  +  ~  ~ (l+c)Fi  +  mj  =  ph(v7  +  ^1^3)  (34) 

For  variation  of  tpi  we  adopt  the  consecutive  sequence  of  body  rotations  Q&h  0& 3 
with  03  =  <p3.  The  matrix  of  direction  cosines  is 


1=A(  0,)B(0a)C(03)= 


C2C3  — C2S3  S2 

S  iSaC3  +  C1S3  — S1S2S3+C1C3  — S 1C3 
C1S2C3  +  S 1S3  C1S2S3+S1C3  C1C2, 


with  angular  velocities  of  the  cross  section  and  the  curvatures  given  by 


(35) 


Ul  1  =  01C3C3  +  02^3  ,  W2  =  02^3  —  ^CaSs  ,  W3  =  03  +  0jS2  (36) 

kj  =  0iCaC3  +  02S3 ,  k2  =  02^3  —  01C2S3 ,  k3  =  03  +  0 jS2  (37) 

For  this  case  lu  does  not  depend  on  03  and  hence  the  Euler  variational  equation  assumes 
the  form 

(^g~)  +  ki*rg~  “  +  m3  —  P{J^3  +  (I2  —  It)oW2]  (38) 

The  specified  boundary  conditions  at  s  =  st,  Sa  must  be  consistent  with 


[(Fi  -  F-  )&,  4-  (Mi  -  Mi)Jv?il2  =  0  (39) 

ai 

for  arbitrary  and  independent  variations  0xi  and  6<p\.  Similar  restrictions  are  imposed  on 
initial  and  final  data,  i.e. 


(40) 


jpA(xi  -  Vi)&i  +  p[Ii(u>t  -  ui)Sipi 
+  Ij(wj  —  <jJi)8<p2  +  J(o/3  —  £>3)^3]  j  =  0 


Comparison  of  equations  (33),  (34)  and  (38)  with  equations  (20a, b,c)  respectively, 
establishes  the  constitutive  relations 


M  <?W 

Mi  =  cnn 


i  =  1,2,3 


(41) 


A  STRAIN  ENERGY  FUNCTION 

In  order  to  gain  an  insight  into  the  nature  of  the  strain  energy  function  we  consider  the 
strain  of  the  lines  and  angles  in  the  cross  section  of  the  rod.  For  this  purpose  we  invoke  the 
Kirchhoff  hypothesis  which  assumes  that  the  plane  cross  sections  of  rod  that  are  normal  to 
the  axial  direction  in  the  unstrained  state  remain  normal  to  the  strained  axial  direction 
during  deformation.  Therefore  the  position  vector  to  a  material  point  in  the  cross-section 
before  and  after  deformation  can  be  given  by 

R  =  X(s)  +  yiEi(s)  +  y2E2(s)  (42) 

and 


r  =  x(s)  +  o(s)[yiei(s)  +  y2e2(s)]  (43) 

respectively.  The  parameter  o(s)  is  to  be  fixed  by  enforcing  traction— free  boundary 
conditions  on  the  lateral  surface  ot  the  rod. 

Using  the  concept  of  extensional  strains  for  stretching  of  line  elements  and  distortion  of 
angles  between  perpendicular  lines  as  shear  strains  (Wempner,  1991),  we  define 
components  of  strain  by 

fij  =  i(8i  *  gj  ~  Gi  *  Gj)  (44) 


where 


dr  dr 

8  "  Wx  ~  aei’  82  =  W*  =  0621 

dn  f  * 

83  =  =  a/?iei  +  a'y^  +  ayt«i  +  ay&i  +  (l+e)e3  (45) 

p  _  dR  _  n  f*  _  6R  n 

-  W\  ”  El’ Gs  ”  Wi =  Ej’ 

m  /  t 

G3  =  =  yiEi  +  y2E2  +  E3  (46) 
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Using  (8)  we  can  establish 

/ 

«i  •  ej  =  fij.k,  (47) 

Ei  •  Ej  =  <ijaK.  (48) 

where  Ki  is  the  curvatures  and  twist  in  the  unstrained  state.  Therefore  (9)  yields  as  the 
strain  components 

<11  =  <23  =  i(<*2  ~  1),  <12  —  0 

<is  =  il^a'yi  -  ay2ks)  +  y2K3] 

<23  =  ia(a/y2  +  ay2k3)  -  yiKs] 

<33  =  e  +  ±e3  -  yi[(l+e)ak2  -  K2]  +  y2[(14-e)akt  -  KJ  -  yiy2(a2k1k2  -  KjK2) 
*y?[*/J  +  a3(k2  +  k2)  -  K]  -  Kj) 

+  *y2[a'  3  +  a3(k2  +  k2)  -  K2  -  K2)  (49) 

For  a  linear  isotropic  elastic  material  the  non-zero  stress  components  per  unit  strained 
area  are 

<r«  =  (A  +  G)(a3  -  1)  +  Ac”  ,  »  0 

<r33  =  (A  +  G)(a3  - 1)  +  Ac33 ,  a23  =  2Ge23 

<r33  =  A  (a3  -  1)  +  (A  +  2G)e33 ,  a 3‘  =  2Ge3i  (50) 

where  A  and  G  are  Lame’s  constant  and  the  shear  modulus,  respectively.  The  traction  per 
unit  undeformed  are  is  then  given  by 

tJ  =  <r3igi  (51) 

One  can  define  the  axial  stress  resultant  F3  by 

F3  =  f  t3  *  e3  dA  ss 
Jk 

~  A  (1+e)  j(A+2G)(e-He3)  +  A(a3  —  1)| 

+  (A+2G)  j^Ii[(l  4-  e)aki  —  K(]aki  +  I2[(l  4-  e)otk2  —  K2]ak2 
+  ^(1  4-  e)(a' 3  +  a3k3  -  K3  4-  a3k3  -  K3) 

+  *I2(1  +  e)(a' 3  +  a3k3  -  K3  +  a3k3  -  K3)]  (52) 

£  2  2  3  3  J 
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Similarly  we  have 


M !  =  / ay3t3  •  e3dA  =  a(A+2G)Ii[(l  +  e)akt  -  Kt](l  +  e) 

+  I,a?ki[A(aa  -  1)  +  (A  +  2G)(e  +  *e3)]  (53) 

Mj  =  - J oyjt3  •  e3  dA  =  a(A+2G)I2£(l  +  e)ak2  -  K2j  (1  +  e) 

+  I2a3k2^A(a3  -  1)  +  (A  +  2G)(e  +  je3)]  (54) 


M3  =  —J  a(y tt3  •  e3  -  y3t3  •  e,)dA  =  JGa3(a3k3  -  Ks) 

+  Ja3k3^A(a3  —  1)  +  (A  +  2G)(e  +  $e3)j  (55) 


One  may  note  that  the  integrability  conditions 


dM.i  _  dM.j  dM. i  _  d M2  5M2  _  dM3 
dk  2  dls.  i  ’  0k  2  0  k  i  ’  0k  3  ~  0  k  2 


(56) 


are  satisfied.  Hence  existence  of  a  strain  energy  function  is  assured  and  by  integration  we 
have 

W  =  e3(l+e+f^)  -  AA(l-a3)(e+je3) 


4-AIick3(q!3— 1)^1  +  (A+2G)Iioki|^— ok  i—(l+e)Ki 
2 

+AI2a3(a3— 1)^2  +  (A+2G)I2ak2  ak2—(l+e)K2 


A+2G 


T 


{l+e): 


2  __2 


2 _ 2. 


Ii(a'3  +  a2k  i— K  t)  +  I2(a'2  +  a2kr-K2) 


+AJo3(ai-l)|i  +  ^+2GJ(e+ie3)(a3k;-K3)  +  ^GJ^akj-Ks)3 


3  xA  .  1, 


(57) 


For  an  initially  straight  rod  Ki  should  be  set  equal  to  zero.  The  above  form  of  W  reflects 
material  isotropy,  i.e.  W(e,ki,  k2,  k3)  =  W(e,  k2,  kt,  k3),  provided  that  Ii  =  I2. 
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We  note  that  positive  curvatures  imply  positive  bending  moments  and  conversely  negative 

curvatures  imply  negative  bending  moments  provided  that  e>(-l+l/f3)  for  a  -  1.  This 
shows  that  equations  (53)  have  a  limited  range  of  validity  if  the  sense  correspondence 
between  moments  and  curvatures  is  to  be  retained.  We  also  note  the  second  order  coupling 
between  the  squares  of  the  curvatures  and  the  axial  strain  e  in  (52),  which  implies  that 
axial  force  can  be  generated  by  bending  or  twist  only. 

The  parameter  a(s)  depends  on  the  boundary  conditions  applied  at  the  lateral  surface  of 
the  rod.  If  the  lateral  surface  is  fixed,  then  a(s)  =  1.  For  zero  tractions  on  the  lateral 
surface,  a  condition  appropriate  for  thin  flexible  rods  is  adopted  according  to  which  the 
average  of  ali  and  <732  over  the  cross-section  should  vanish,  i.e. 

fk<T" dA  =  fk<r**dA  =  0  (58) 

The  above  condition  reduces  to 

H(a,ki,e)  =  a' J  +  a3(I,k?  +  I2kJ  +  Jk\  +  £) 

-d*!  +  I2Kj  +  JKJj)  +  2Ae(l  +  *e)  -  £  =  0  (59) 

This  equation  should  be  interpreted  as  a  differential  equation  for  a(s)  when  ki  and  e  are 
known.  To  achieve  this  (52)  is  solved  in  ah  iterative  procedure  in  which  at  every  step  k; 

and  e  are  known.  We  begin  by  writing  (52)  as  lim  11(0,,^  k?,  en)  =  0  with  n  =  0,  1,  2,  3, 

n-*® 

...  For  the  first  iteration  (n=0),  e<j  =  0,  k°  =  ,  ax  =  1,  and  the  six  equations  (15)  -  (16) 

after  using  (22),  (33)  and  (57)  contain  only  six  unknown  quantities  F,,  F2,  et  and  k-  when 

the  externally  applied  forces  and  moments  are  prescribed.  Solution  of  this  set  of  equations 

enables  one  to  use  k{  in  the  curvature-orientation  angle  relations  such  as  (31)  to  determine 

the  latter  i.e.  9\.  Now  ly  (0^)  are  known  and  one  proceeds  to  determine  yn  from  (21)  and 
Xj  from  (9)  using  the  appropriate  boundary  conditions.  The  solution  for  the  first  iteration 

is  complete.  One  enters  (52)  with  ei  and  k-  and  computes  a2  and  the  iteration  proceeds. 
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VISCOHYPERELASTICITY 
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ABSTRACT 
12  3  4 

A  recently  developed  *  ’  *  internal  solid  theory  for  rubber 

viscoelasticity  is  reviewed  in  the  context  of  finite  hyperelasticity. 

Viscohyperelasticity,  in  which  time  dependent  reference  geometries  are  used 

with  internal  solids  having  hyperelastic  energy  functions,  is  developed.  The 

material  constants  for  the  internal  solids  are  determined  by  tensile,  shear 

and  biaxial  tests.  Computational  aspects  of  the  theory  and  its  inclusion 

into  finite  element  analysis  are  discussed  here.  Previous  finite  element 
12  3  4 

models  *  ’  ’  are  improved  with  a  predictor-corrector  scheme.  Time  dependent 

pressure  loads  are  applied  to  the  interior  of  a  thick  walled  rubber  cylinder 

to  demonstrate  the  use  of  viscohyperelasticity. 

INTRODUCTION 

The  development  of  useful  computational  models  for  rubber  vulcanizates 
requires  interactions  between  experimentalists  and  analysts.  These 
interactions  often  involve  modeling  the  performance  of  rubber  subjected  to 
dynamic  strains.  Many  studies  have  been  done  to  analyze  the  case  of  small 
dynamic  strains  about  a  large  strain  deformed  elastic  state.  Although  the 
information  obtained  from  such  studies  has  proven  valuable  for  the  design 
process,  it  does  not  preclude  the  need  for  a  capability  to  measure  and 
predict  large  strain  dynamic  data.  Viscoelastic  models  valid  for  large 

strains  have  been  under  development  for  about  forty  years.  Typical 

weaknesses  of  these  models  are  a  lack  of  standard  laboratory  tests  to 
determine  their  parameters  and  their  inability  to  predict  a  wide  range  of 
large  strain  dynamic  test  data. 


* 

A* 
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History  integral  formulations  for  modeling  the  viscoelastic  response  of 

elastomers  have  been  used  by  Bernstein,  Kearsley  and  Zapas^  and  McGuirt  and 
ft  7 

Lianis  ’  •  Their  efforts  consisted  of  applying  the  theory  of  Green  and 

g 

Rivlin  to  the  viscoelastic  deformations  of  both  low  and  high  percentage 

crosslinked  elastomers.  In  both  cases  they  demonstrated  that  only  a  few 

parameters  are  needed  to  model  most  materials  of  interest.  However,  the 

history  integral  method  has  not  proved  attractive  for  finite  element 

algorithms  due  to  its  large  storage  and  computational  requirements.  The  fact 

that  only  a  few  parameters  were  needed  to  model  viscoelastic  effects  in  the 

rubberlike  materials  they  tested  implies  that  alternative  models  may  also  be 

useful.  Finite  element  viscoelasticity  algorithms  using  Kelvin  elements 

9 

(valid  for  small  strains)  were  developed  by  Zienkiewicz,  Watson  and  King  , 
and  Carpenter^.  These  algorithms  avoided  the  use  of  the  history  integral 
method,  are  computationally  attractive  and  have  given  acceptable  accuracy  for 
small  strains. 

j  2  3  4 

In  this  paper  we  review  viscohyperelasticity  ’  ’  ’  and  present  a  finite 
element  analysis  of  a  thick  viscoelastic  rubber  cylinder  subjected  to  a 
cyclic  internal  pressure  loading.  We  also  restrict  this  paper  to  the 
essentials  of  the  computational  algorithm. 

VISCOHYPERELASTICITY 

Background  information  on  hyperelasticity,  determining  principal 

stretches,  strain  invariants,  etc.  is  available  in  many  books  (for  example, 
11  12  13 

see  Treloar  ,  Ogden  ,  or  Green  and  Zerna  ).  The  first  new  concept  to 
describe  in  viscohyperelasticity  is  that  of  a  changing  reference  shape.  This 
can  be  seen  by  considering  the  step-strain  relaxation  of  a  two  network  system 
consisting  of  chemically  crosslinked  molecules  and  entangled  molecules  as 
shown  in  Figure  1.  The  sides  of  the  rectangles  represent  the  chemically 
crosslinked  molecules  which  are  not  damaged  during  deformation.  The  diagonal 
represents  a  molecule  which  is  rigidly  bonded  at  its  ends  to  the  chemically 
crosslinked  network  and  the  loop  on  the  diagonal  represents  an  entanglement 
which  can  slip  when  stressed.  During  the  step-strain  relaxation  the  stress 
due  to  the  deformation  of  the  chemically  crosslinked  system  (square  -*• 
rectangle)  remains  constant  in  time.  It  is  determined  from  a  hyperelastic 
energy  function.  The  stress,  a  in  Figure  1,  due  to  the  deformation  of  the 
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entangled  molecule  (or  a  molecule  which  breaks  and  reforms  continuously  in 

time)  decreases  in  time  since  the  reference  shape  continuously  changes  until 

it  looks  like  the  deformed  shape  (see  the  bottom  of  Figure  1).  The  second 

14 

concept  is  related  to  Green  and  Tobolsky's  work,  it  states  that  the  force 
driving  the  changes  in  reference  shape  is  a  function  of  the  difference  (in 
the  sense  of  a  mapping  function)  between  its  current  shape  and  the  shape  of 
the  chemically  crosslinked  network  to  which  it  is  attached.  The  overview  of 
viscohyperelasticty  follows. 


Consider  a  collection  of  N  solids,  with  reference  shapes  given  by 

coordinates  (X  ,,  X  X  „),  where  s  =  1,  2 . N,  and  each  with  the  same 

si  s2  s3 

deformed  shape  given  by  coordinates  (x^ ,  x^,  x^).  Let  one-to-one 
differentiable  mappings  between  the  reference  shapes  and  the  deformed  shape 
be  indicated  by 


D 

s 


=  D  (X  , 
s  s 


x) 


(1) 


where  Xg  =  (Xg^,  X^,  Xg^)  »  x  =  X2*  x3^‘  see  Fi8ure  2.  The 

mappings  of  equations  (1)  are  used  to  determine  the  principal  stretches  for 


Xs  •  (!W  XS2-  Xs3> 


each  solid,  Ag  =*  (Ag^,  ^s2’  ^s3^”  ^et  t*ie  s°li(*s  be  isotropic  and 

hyperelastic  with  energy  density  functions  given  by 


W 

s 


=  W  (A  ) 
s  s 


(2) 


Then,  for  the  case  of  incompressible  solids,  the  principal  Cauchy  stresses 
are  given  by 


i  ^ 

X  .  —  A  .  —  r  -  -  p 

si  sx  3A  .  rs 
si 


i  =  1,  2,  3 


where  pg  *  the  hydrostatic  pressure  which  is  determined  either  analytically 
by  the  stress  boundary  conditions  or  numerically  by  the  requirement  that  the 
mapping  of  equation  (1)  represent  an  incompressible  deformation.  In 
continuum  mechanics  the  study  of  rubber  elasticity  (for  incompressible 
deformations)  is  focused  on  the  mathematics  of  (1)  and  (3),  and  on  the 
physics  and  mathematics  of  (2). 

Assume  the  stresses  in  (3)  are  viscoelastic.  Then,  let  s  =  1  represent 
the  long  term  (relaxed)  hyperelastic  solid  whose  reference  geometry  does  not 


237 


change  in  time.  That  is,  x^  =  the  relaxed  stresses  at  a  specified 
deformation.  Let  the  remaining  solids  have  time  dependent  reference 
geometries.  We  then  have  Xg  =  Xg(t)  =  (Xg^(t),  ^^(t),  X^Ct))  for 
s  =  2,  3,  ...»  N.  The  Cauchy  stress  vector  in  the  time  dependent  deformed 
body  x(t)  =  (xj(t),  x^Ct),  x^(t))  from  all  of  these  solids  is 

N 

t) }  *  {x1  (Xj,  x(t))}  {xg(Xg(t),  x(t))}  (A) 

s  =  2 


where 


3 

'<V  *1  x*iiei} 

i  =  1 


with  s  »  1,2,3 


and  {e^}  =  t^le  Cartesian  unit  vector  for  the  i’th  principal 
Cauchy  stress. 


We  will  have  a  viscoelastic  model  if  X  varies  in  time  such  that  x(t) 

s 

relaxes.  That  is,  such  that  xg^(t)  "*  ®  for  s  *  2,  ....  N  when  x(t)  =  a 
constant  shape. 

The  viscohyperelastic  model  simulates  relaxation  as  follows.  At  any 

instant  of  time  let  the  deformed  state  x(t)  be  the  relaxed  state  of  X  (t)  for 

s 

s  =  2,  ...,  N.  That  is,  if  x(t)  is  held  constant  at  xc  then  require  X  (t)  ■+ 
xc  as  t  -*>  «.  This  represents  a  kinematical  relaxation  process  for  which 


x  .(X  (t),  x  )  ■*  x  .(x  ,x)  =  0  as  t->°°  (5) 

si  s  c  si  c  c 

Then,  from  (4),  {x.(t)>  -*■  {x^(X^,  xc)}  =  the  long  term  hyperelastic  Cauchy 

stress  vector  in  the  solid  as  t  -*■  °®. 

At  this  point  we  note  that  there  are  different  ways  to  enforce  the 

kinematical  statement  X  (t)  ■+  x  .  We  have  selected  to  drive  the  kinematics 

s  c 

with  Cauchy  stresses  on  the  shapes  Xg(t).  These  stresses  are  generated  as 
follows.  Let  the  inverse  mapping  of  (1)  be  given  by 


d=  =  d  ( x ,  X  )  =  D_1(X  ,  x) 
s  s  s  s  s 


for  s  =  2, 


(6) 


Use  equations  (6)  to  determine  principal  stretches  for  each  inverse  mapping, 
-  (A^,  Ag2 1  Ag3> .  Note,  since  we  assumed  the  mapping  (1)  represents  an 


A 


incompressible  deformation  then  the  inverse  mapping  of  (6)  also  represents  an 
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incompressible  deformation.  For  each  mapping  (6)  define  hyperelastic  energy 
density  functions 


ft 

s 


W 


(7) 


Then,  the  principal  Cauchy  stresses  on  the  geometry  Xg  with  respect  to 
geometry  x  as  a  reference  are  given  by 


3W 

s 

X  .  -  A  .  -TT -  -  P 

si  si  3A  .  rs 

si 


i  -  1.  2,  3 


(8) 


where  p  =  a  hydrostatic  pressure.  The  energy  function  ft  is  ad  hoc  in 
s  s 

this  model.  The  desire  is  to  determine  a  ftg  which  will  allow  large  strain 
viscoelastic  stress  data  to  be  modeled.  Note,  Maxwell  models  are  also  ad 
hoc. 


The  stresses  in  (8)  are  of  a  hyperelastic  form  and  the  internal  solid 
model  is  completed  by  requiring  that  X  (t)  -►  x(t)  by  a  relaxation  of  x  . . 
That  is,  determine  X(t)  by  integrating  the  following  differential  equation 
which  states  that  the  rate  of  change  of  rgi  per  unit  time  is  proportional 


dx  . 

SI 

3x  . 
si 

dX 

s 

3t 

3X 

dt 

(9) 


s 

Equation  (9)  is  a  nonlinear  differential  equation  whose  integral  determines 

the  time  dependent  undeformed  geometries.  To  implement  the  theory  we  must 

measure  stress  relaxation  data  and  use  it  to  determine  W,  ,  W  ,  ft  and  n  for 

i  s  s  s 

s  =  2,  ...,  N.  Below,  we:  (a)  present  a  one  dimensional  example  of  the 

the  theory  in  which  the  form  of  ft  is  identical  to  the  form  of  W  ,  (b)  review 

s  s 

the  finite  element  implementation  of  viscohyperelastic  theory  and  (c) 
computationally  demonstrate  the  method  by  applying  a  dynamic  pressure  loading 
to  the  interior  of  a  thick  walled  cylinder. 


ONE  DIMENSIONAL  MODEL 

A  one  dimensional  time  dependent  internal  solid  (network)  is  shown  in 
Figure  3.  It  is  assumed  that  in  all  states  the  material  is  NeoHookean  with  a 
shear  modulus  given  by  p.  Its  time  dependent  reference  length  is  given  by 
L(t)  and  its  deformed  (actual)  shape  is  given  by  x(t).  Following  Treloar^, 
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I 


at  any  time,  t,  the  force  required  to  hold  the  material  at  length  x(t)  is 
given  by 


c 

f(t)  =  (I 


A 


1 

X 


do) 


x(t) 

where  A  **  -  =  A(t)  =  the  internal  solid's  stretch  ratio. 

L(t) 

Next,  we  compute  a  force  (Cauchy)  on  th^  current  reference  shape,  L(t),  by 
computing  a  stretch  A(t)  =  L(t)/x(t).  This  stretch  will  go  to  unity  as 
L(t)  x(t).  The  force  is  given  by 


g(t) 


(ID 


The  model  is  completed  by  relaxing  g(t)  with  a  time  constant  n  as  follows 


3g  3g  dA 

_n  —  =  - *  g  (12) 

3t  3A  dt 

Given  x(t)  (enforced  motion),  equation  (12)  can  be  numerically  integrated  and 
the  time  dependent  viscous  force  for  this  internal  solid  (network)  is  then 
given  by  equation  (10).  If  f(t)  is  specified  then  a  predictor-corrector 
algorithm  (described  below)  is  useful  for  obtaining  the  simultaneous 
solutions  of  both  (10)  and  (12). 

Figure  4  contains  a  typical  response  of  the  above  one  dimensional 
internal  solid  when  it  is  subjected  to  a  step-strain  relaxation  test.  A  more 
general  one  dimensional  model  is  easily  obtained  by  superimposing  a  number  of 
internal  solids  (different  energy  functions  and  relaxation  times)  with  a  long 
term  solid  (time  independent  reference  shape). 


FINITE  ELEMENT  ANALYSIS 

Consider  the  superposition  of  solids  indicated  in  Figure  2  with  finite 
element  discretizations  as  shown  in  Figure  5.  Assuming  inertial  forces  are 
negligibly  small,  the  time  dependent  potential  energy  of  all  the  solids  in 
Figure  5  is  given  by 
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N 

n  *  n  »  I  [w  (x  ,x)  +  p  (x  ,x) ]  -  <p(x,t) 

©  •»  S  S  S  S 

e  s=l 


(13) 


where  Xs,x 


global  vectors  of  nodal  unknowns. 


Wg(Xs,x)  =  hyperelastic  energy  function  for  solid  s, 

P  (X  ,x)  =  penalty  function  to  enforce  incompressibility 
s  s 

for  solid  s, 

and  <p(x,t)  =  work  done  by  applied  forces. 

The  first  variation  of  II  in  equation  (13)  yields 


«n  = 


r  N 
2  8e 
s=l 


where 


gs  = 


h  = 
s 


and 


3x 

3 

3X 

s 

3(p 


W  +  P 
s  s 


r 


w  +  p 

s  s 


N 

6x  +  Eh  6X 
„  S  s 
s=2 


=  gs(Xg,x)  s  =  1,2, ... ,N 


=  gs(Xs,x)  s  =  2,3, ... ,N 


(14) 


f  *  —  =  f (x,t) . 

3x 

The  variables  can  not  be  freely  changed.  They  must  change  in  accordance 
with  their  relaxation  equations  hence,  6Xg  *  0.  Thus,  the  equations  of 
motion  are  given  by  setting  the  coefficient  of  5x  in  equation  (14)  to  zero. 
These  equations  must  be  solved  simultaneously  with  the  relaxation  equations. 
Thus,  the  solutions  are  obtained  by  solving 


N 

l  g  (X  ,x)  -  f ( x , t )  =  0 
s=l  s 


and 


where 


-n  K  (x,X  ) 
s  s  s 


3X 


3t 


=  8, 


s  =  2,3, ...N 


(15) 


(16) 


3X 


(  W  +  P  ) 
s  s 


i  =  — =  (  W  +  P  ) 

S  3X2  S  S 
s 


V)  (x,X  )  =  a  hyperelastic  energy  function  for  the  relaxation  of  X  , 
s  s  s 
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and  Pg(x,Xs)  =  a  penalty  function  to  enforce  incompressibility  on  Wg. 

The  integration  of  equations  (15)  and(16)  can  be  accomplished  with  the 

following  predictor-corrector  scheme.  Let  x(n),  Xg(n)  and  x(n+l),  Xs(n+1)  be 

the  configurations  shown  in  Figure  5  at  times  t  and  tn+j.  Also  let  At  = 

t  ,,  -  t  and  let  x^k^(n+l),  X^k\n+1)  be  estimates  of  x(n+l),  X(n+l)  at 

n+1  n 

iteration  k.  We  make  a  trapezoidal  prediction  for  equation  (16)  and  a  Newton 
-  Raphson  correction  of  equation  (15)  as  follows. 

A.  Initialize  X  (n+1)  with  an  Euler  step  on  the  relaxation  equation. 

—  s 

At  -1 

AX  (n)  =  -  [K  (x(n),X  (n)]  g  (x(n),X  (n))  (17) 

s  s  s  s  s 

ns 

X  ^(n+l)  =  X  (n)  +  AX  (n)  (18) 

s  s  s 

B.  Initialize  x(n+l) 

x^\n+l)  =  x(n)  (19) 

Note:  As  vectors  x(n)  are  collected  equation  (19)  can  be  improved  with 
conditional  extrapolations  using  previous  states  x(n-l),  x(n-2),  etc. 


C.  Compute  equilibrium  at  tR+^ . 

G(k)(n+1)  =  S  gs(Xs(k)(n+l),x(k)(n+l))  -  f (k) (x(k)(n+l ) , tR+1 )(20 ) 
s=l 

D.  Check  for  convergence. 

||  G(k)(n+1)  ||  <  e  (21) 

E.  If  converged,  go  to  the  next  time  step,  equation  (17),  if  not 
converged  update  x(n+l)  using  a  Newton  -  Raphson  step  on  the  equilibrium 
equation. 

x(k+1)(n+l)  =  x(k) (n+1 )  -  [KG(k)(n+l)f 1  G(k)(n+1)  (22) 


where 


Kr(k)(n+1)  =  2  K  (X  (k)(n+l),x(k)(n+l))  -  K,(x(k)(n+l),t  ) 

G  ,  s  s  x  n+i 

s=l 

2 

K  =  |-=  (W  +  P  )  and  K,  = 
s  3x2  s  s  f  3x 

F.  Finally,  we  make  a  trapezoidal  update  to  Xg(n+1)  using  x^k+^(n+l) 
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and  the  relaxation  equation 

AX  (n+1 )  =  -  [R  (x(k+1)(n+l),Xc(k(n+l))]  * 

s  ^  s  s 

8s(x(k+1)(n+l),Xs(k)(n+l)) 

X  ^k+1)(n+l)  *  X  (n)  +  1/2  (  AX  (n)  +  AX  (n+1)) 
s  s  s  s 

G.  Go  to  equation  (20). 


The  above  integration  algorithm  has  the  desirable  feature  of  maintaining 
the  volume  constraint  on  all  solids  as  accurately  as  numerically  possible  at 
each  time  step  (given  that  we  have  a  penalty  formulation).  We  now  determine 
the  large  time  dependent  displacement  of  an  inflated  thick  walled  cylinder. 
The  dimensions  of  the  cylinder  are  shown  in  Figure  6.  It  is  assumed  that  the 
cylinder  does  not  deform  in  the  axial  direction.  Although  this  problem  can 
be  solved  with  a  one  dimensional  model  we  use  it  to  test  the  ax i symmetric 
finite  element  code  (check  for  symmetry,  etc.).  We  model  the  long  term 
hyperelastic  response  with  solid  ill  and  the  Mooney  energy  function  by  Oden 
as  modified  for  a  penalty  enforcement  of  the  volume  constraint  by  Fried  and 
Johnson^.  Two  internal  solids  were  used,  one  quick  and  one  slow  relaxing 
solid.  The  energy  function  for  each  solid  was  (s  =  solid  it) 


W  =  A  (I.  -  31, 

s  s  Is  3s 


!)  +  Vhs  '  3I3sZ/3>  +  "  1”2<I3.1 


where  I^g,  l2g,  and  I^s  are  the  strain  invariants  for  each  solid's  mappings 
(equation  (1)).  The  material  constants  are  listed  in  Table  1. 

Table  1.  Material  constants  for  solids. 


Solid  it 


A 

s 

B 

s 

A 

s 

ns 

80. 

20. 

50,000. 

- 

20. 

5. 

12,500. 

2. 

20. 

5. 

12,500. 

10. 

1 

_ _ 1 
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Details  on  the  element  mappings,  the  expressions  for  the  work  performed 
during  inflation,  etc.,  that  are  not  provided  in  Reference^  are  included  in 
Reference^ . 

The  pressure  vs  time  curve  for  the  first  example  is  given  in  Figure  7. 
In  this  case,  the  pressure  was  ramped,  held  constant,  ramped,  and  then  held 
constant  again.  Figure  7  also  shows  the  dynamic  displacement  of  the  inner 
surface.  The  large  strain  creeping  under  constant  loading  is  clear.  Also, 
the  deformed  mesh  is  compared  to  the  two  time  dependent  reference  meshes  at 
both  t  *  30  sec  and  t  -  75  sec.  Figure  8  shows  the  pressure  vs  inner  radius 
displacement.  The  static  solutions  of  Oden^  are  approached  in  Figure  8  at 
the  end  of  the  creeping  (constant  load). 

A  cyclic  sawtooth  pressure  loading  was  applied  in  the  second  example. 
The  pressure  ♦'ime  curve  is  given  in  Figure  9.  The  inner  radius  vs  time  graph 
demonstrates  softening  effects  (the  displacements  continue  to  grow  in  each 
cycle).  Also,  at  t  =  10  sec  the  time  dependent  reference  geometries  and  the 
deformed  mesh  are  compared.  Solid  it 2's  reference  shape  closely  follows  the 
deformed  shape  (fast  relaxing  material).  That  is,  it  is  nearly  in  phase  with 
the  deformed  shape.  Solid  it 3  (slow  relaxing  material)  is  out  of  phase  with 
the  deformed  shape  in  this  example.  The  pressure  vs  displacement  hysteresis 
loops  are  shown  in  Figure  10  with  the  long  cerm  (quasistatic)  solution  of 
Oden. 


SUMMARY 

An  internal  sol’d  model  for  large  strain  viscoelasticity  in  which 
hyperelastic  solids  with  time  dependent  reference  shapes 
(viscohyperelasticity)  are  superimposed  was  reviewed.  The  time  dependent 
kinematics  are  driven  by  the  relaxation  of  hyperelastic  forces.  Like  a 
Maxwell  model  in  small  strain  theory,  both  creep  and  relaxation  behavior  can 
be  simulated.  Viscohyperelasticity,  however,  allows  for  the  viscous  forces 
to  be  dependent  on  both  the  large  strain  history  of  the  deformation  and  on 
the  current  large  strain  state  (through  the  hyperelastic  energy  functions 
selected  to  drive  the  kinematics).  The  inflation  of  a  thick  walled  cylinder 
was  determined  using  the  finite  element  method  to  demonstrate  the  theory. 
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Chemically  crosslinked  time  independent  network. 
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Figure  1.  Conceptual  model  of  step  strain  relaxation. 
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Figure  3.  One  dimensional  model  of  time  dependent  network. 
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internal  pressure  loading  of  a  thick-walled  cylinder. 
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Figure  9.  Deformed  meshes  at  t  =  10  sec,  cyclic  loading. 
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DISPLACEMENT  OF  INTERIOR  NOOE  (in) 

Figure  10.  Pressure  vs  radial  displacement  of  interior  surface,  cyclic  loading, 


MULTIGRID  ALGORITHMS  FOR  THE  FIRST  BIHARMONIC 
PROBLEM:  ROBUSTNESS* 

M.  R.  HANISCH 

MATHEMATICAL  SCIENCES  INSTITUTE 
CORNELL  UNIVERSITY 
ITHACA,  NY  14853 

Abstract.  We  consider  the  numerical  solution  of  the  first  biharmonic  problem.  Numerous  discretiza¬ 
tions  proposed  for  this  problem  lead  to  large,  sparse,  and  extremely  ill-conditioned  linear  systems  AU  —  F. 
We  shall  describe  certain  fast  solvers  for  such  systems.  In  particular,  for  the  mixed  method  of  Ciarlet  and 
Raviart,  and  for  the  nonconforming  finite  element  method  of  Morley,  we  describe  multigrid  iterative  solvers 
for  the  systems  AU  =  F. 

Theoretical  results  and  practical  computations  have  shown  that  the  multigrid  V-cycle  and  W-cycle  iter¬ 
ations  work  extremely  well  for  second-order  elliptic  problems.  However,  the  V-cycle  and  W-cycle  iterations 
may  diverge  when  applied  to  the  first  biharmonic  problem.  It  is  nevertheless  possible  to  construct  provablv 
robust  multigrid  iterations  for  these  fourth-order  problems.  From  a  single  “Variable  V-cycle-’  iteration  we 
construct  a  preconditioner  8  with  the  property  that  BA  is  uniformly  well-conditioned.  Furthermore.  BA 
is  symmetric  with  respect  to  an  appropriate  inner  product.  The  linear  systems  AU  =  F  may  then  be 
rapidly  solved  with  a  preconditioned  conjugate  gradient  iteration.  The  effectiveness  of  this  strategy  will  be 
illustrated  computationally.  For  example,  on  a  255x255  mesh  where  the  condition  number  of  A  may  exceed 
10®,  ,ve  obtain  BA  with  condition  number  typically  less  than  20. 


1.  Introduction.  In  this  paper  we  consider  the  behavior  of  multigrid  iterations  ap¬ 
plied  to  finite  element  schemes  for  the  first  biharmonic  problem  in  the  plane.  The  first 
biharmonic  problem  models  the  bending  of  clamped  elastic  plates  and  in  fluid  dynamics, 
two-dimensional  Stokes  flow.  We  specifically  consider  the  Ciarlet- Raviart  mixed  method 
[II]  and  the  nonconforming  method  of  Morley  [23].  Additional  mixed  methods,  the  method 
of  Herrmann  and  Miyoshi  [16,  17,  22],  and  of  Herrmann  and  Johnson  [16,  17.  18],  and  the 
method  of  Raviart  and  Thomas  [25]  for  second-order  problems,  may  similarly  be  considered: 
cf.  [15],  Each  of  these  methods  leads  one  to  solve  a  sparse  but  extremely  ill-conditioned 
linear  system.  And  in  the  case  of  the  Ciarlet- Raviart  mixed  method  we  obtain  an  indefinite 
block  matrix  equation  of  the  form 


(1.1) 


(  Q  V1 

\v  o 


Sh 

Uh 


Here  Uh  and  Sh  are  vectors  of  approximate  nodal  values  for  the  solution  and  its  negative 
Laplacian.  For  simplicity  we  shall  take  G  =  0,  in  which  case  (1.1)  corresponds  to  the 
Dirichlet  problem  with  homogeneous  boundary  data.  Eliminating  St,  from  (1.1),  we  obtain 
an  equation  involving  an  ill-conditioned  but  typically  symmetric  positive  definite  Schur 
complement 


(1.2) 


Af  \  =  [T>Q-lV*}Uh  =  F. 


With  Uh  obtained  from  (1.2).  one  may  then  compute  Sh  -  ~Q  The  costly  com¬ 

putation  of  Q_1  may  be  avoided  using  a  technique  described  in  Section  2.1. 


*  This  work  was  partly  supported  by  the  U.S.  Army  Research  Office  through  ihe  Mathematical  Sciences 
Institute  of  Cornell  University. 
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It  can  be  proved  that  a  multigrid  W-cycle  iteration  which  uses  a  sufficiently  large 
but  undetermined  number  of  smoothing  steps,  a  number  we  shall  parametrize  with  m. 
can  be  used  to  solve  the  Morley  equations  and  also  the  Ciarlet-Raviart  equations  through 

(1.2) .  The  hypothesis  that  1a  is  sufficiently  large  is  common  to  many  multigrid  analyses. 
For  example,  Brenner  [9]  has  proved  convergence  of  a  W-cycle  iteration  for  the  Morley 
equations  when  m  is  sufficiently  large.  And  if  m  is  sufficiently  large,  a  W-cycle  iteration 
for  the  Ciarlet-Raviart  normal  equations,  Af2  =  ,V"CV\  can  be  proved  to  converge;  see  [24]. 
Furthermore,  practical  computations  demonstrate  that  m  =  1  is  large  enough  for  W-cycle 
convergence  for  many  problems.  However,  for  the  Morley  and  Ciarlet-Raviart  equations, 
larger  values  of  m  may  be  required  for  W-cycle  convergence.  A  numerical  example  i,;  given 
in  Section  4  for  which  W-cycle  convergence  is  not  guaranteed  unless  m  >  8.  As  a  value  for 
m  sufficient  for  convergence  cannot  be  specified  a  priori.  W-cycle  iterations  are  not  robust 
for  these  problems. 

To  obtain  a  robust  multigrid  scheme,  we  will  construct  a  multigrid  preconditioner  B  for 
the  Schur  complement  A.  The  action  of  this  preconditioner  is  computed  by  performing  a 
single  iteration  of  a  particular  multigrid  scheme.  A  multigrid  preconditioner  will  similarly 
be  constructed  for  the  symmetric  positive  definite  Morley  system.  Conjugate  gradient 
iterations  for  the  preconditioned  problems,  for  example 

(1.3)  BAUh  -  B  F . 

are  known  to  converge  with  a  rate  which  is  bounded  by  a  function  of  the  conditic  n  number 
of  BA,  k 2(BA).  The  smaller  the  condition  number,  the  better  the  bound,  as  we  shall  see  in 
(3.20)  of  Section  3.2.  It  will  be  shown  that  a  "Variable  V-cvcle”  multigrid  precoi.ditioner 
yields  problems  (1.3)  with  small  condition  number,  bounded  independently  of  the  mesh 
diameter  for  a  scale  of  underlying  finite  element  meshes.  This  is  an  improvement  over  a 
result  of  Braess  arid  Pcisker  [4]. 

The  lack  of  W-cycle  robustness  observed  with  the  Morley  and  Ciarlet-Raviart  methods 
can  be  understood  to  be  a  consequence  of  certain  features:  namely,  t  fie  non  nest  edness  of 
the  Morley  finite  element  spaces  and  the  noninheritedness  of  quadratic  forms  induced  by 
equations  of  the  form  ( 1.2).  However,  a  muitigrid  analysis  based  on  the  theory  of  Bramble. 
Pasoiak,  and  Xu  |8j  can  be  provided  in  each  case.  Accordingly,  an  "Approximation  ami 
Regularity"  property  must  be  proved. 

We  will  assume  that  problems  are  posed  on  convex  polygonal  domains  0;  i.e..  //  ■- 
regularity  is  obtained  for  the  first  biharmonic  problem.  The  muitigrid  analysis  is  extended 
to  nonconvex  polygonal  domains  in  [ioj.  Finally,  we  note  that  in  the  "inherited  form, 
nested  spare’’  setting  the  W-cycle  is  provably  robust;  see  [2.  3.  (j.  i!i.  20.  ill. 

This  paper  is  arranged  in  the  following  manner.  In  Section  2.  we  outline  the  various 
fit  tp  element  methods  considered  for  the  first  Inharmonic  problem.  1  he  inulligrid  pre 
conditioner  is  described  in  Section  3.  In  the  final  sec' ion.  we  present  the  ro-ults  of  several 
com  jm  t  ations. 

I  hroughout  this  paper  we  use  <  '  io  denote  a  generic  positive  constant  which  i>  inde 
pendent  of  i  he  mesh  parameter  h. 
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2.  Finite  element  methods.  Given  a  convex  polygonal  domain  ft  in  R2  with  bound¬ 
ary  dft,  the  first  biharmonic  problem  (with  homogeneous  boundary  data)  is: 


(2.1) 


A2  u  =  /  in  ft, 
u  =  I;;  =  0  on  <9ft. 


where  A  denotes  the  Laplacian  operator  and  ^  is  the  normal  derivative  at  the  boundary 
of  ft.  This  problem  provides  a  simple  model  for  the  displacement  of  a  clamped  elastic 
plate,  or  for  the  stream  function  of  a  steady-state  planar  Stokes  flow. 

Recall  the  definition,  for  nonnegative  integer  s,  of  the  Sobolev  spaces  containing  func¬ 
tions  with  square-integrable  derivatives, 

(2.2)  iT(ft)  =y  {u  €  L3(Sl)  :  Dav  E  I2(ft),  for  |a|  <  s} , 


where  D  denotes  the  distributional  or  weak  derivative,  and  q  is  a  multi-index.  The  Sobolev 
space  Hs(£l)  is  a  Hilbert  space  and  we  Suaii  denote  its  norm  by  We  will  refer  to 

additional  spaces  //g(ft)  which  may  be  defined  as  the  completions  of  C’o°(ft)  with  respect 
to  the  norms  ||-||s.  (Denote  by  Co°(ft)  the  space  of  infinitely  differentiable  functions  with 
compact  support  contained  in  ft.)  A  useful  a  priori  inequality  can  be  obtained  for  solutions 
to  (2.1)  when  /  is  given  in  certain  negative  norm  (dual)  spaces  H~s(Q)d=  [//fj( S2 )]'.  In 
particular,  with  ft  a  convex  polygonal  domain  and  /  E  //“’(ft),  the  unique  existence  of  a 
solution  u  €  //3(ft)  D  //g(ft)  to  (2.1)  is  known  and  we  have 


(2.3)  NI3<C||/m, 

for  a  constant  C  which  is  independent  of  /,  see  [13]. 

2.1.  The  Ciarlet-Raviart  method.  Introducing  an  auxiliary  variable  ct  such  that 


a  =  -  Au  and  -  Act  =  /  . 


and  using  a  standard  Green's  formula,  one  may  obtain  a  weak  formulation  for  (  2.1). 


(2.4) 


Find:  {ct,  u}  e  // ’(ft)  x  Hq( ft)  such  that  for  /  E  II  '(ft). 

(  (ct.i;)^  -  D(t;,«)  =  0  V  t;  E  //’(ft), 

-  D  (ct,  w)  =  -(/.  w)  V  w  E  //(]( ft )  . 


where  the  "Dirichlet  form”,  D(-, -).  and  the  /,2-inner  product  are  given  by 


D(  F-  »') 


drj  r 

Ju 


r  j  T  t'  dx  < 


I'HIl  i 


.  , ,  icj  [ 

1  '■  •  r  1 1,,  - 

■hi 


tlx 


This  formulation  was  studied  by  Ciarlet  and  Raviart  in  [11].  It  is  not  difficult  to  show 
that  the  problem  (2.1)  has  a  unique  solution  {ct.  «}  E  //’(ft)  x  //,](ft)  for  ail  /E  //"'(ft). 
I urt herniore.  ct  =  -Ah.  and  this  same  a  solves  (2.1  ?  in  an  appropriate  sense. 

Given  a  regular  and  quasi-uniform  triangulation  (in  the  sense  of  Ciarlet  [it);)  rh  of  ft. 
with  mesh  diameter  h.  define  finite  dimensional  subspaces  \),  C  //'(ft)  ami  ,V/;,  c  //,}( ft) 
from  the  space 


(2..r>) 


S™  *=  {r  E  C°(ft)  :  r\T  E  />(m)C/'i.  V  l  G  rh  }  . 
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where  p(m)(T)  is  the  space  of  polynomials  of  degree  m  or  less  over  triangle  T.  Set  Vh  —  S™ 
and  Mh  =  S ™  fl  Hq(CI).  The  Ciarlet-Raviart  mixed  method  approximates  the  solution 
{a,  u}  of  (2.4)  with  the  unique  solution  to  the  following  problem. 

Find:  {cr/i,u^}  £  Vh  x  Mh  such  that  for  f  £  H~l(Q), 

(2-6)  <  {vk,v)L2  -  D  (v,uh)  =  0  Vv£Vh, 

„  -Dfo-^tn)  =  -(/,u> )  V  w  £  Mh- 

Choosing  bases  {<^*}  for  1 4  and  {ft}  for  Mh,  consider  the  linear  system  (2.6)  in  block 
matrix  form  with  the  notations  [Vh]{j  =  -T>  {ft  ,ft),  [Qh]t]  =  (ft,ft)L2,  [F\3  =  (f,ft), 
and  denote  the  transpose  of  Vh  by  V\.  Applying  block  Gaussian  elimination  we  obtain 
the  reduced  system  for  (the  coefficients  of)  Uh 

(2-7)  VhQi1VtkUh  =  F. 

As  was  noted  in  the  Introduction,  it  is  possible  to  avoid  computing  the  action  of  the 
inverted  Gramm  matrix  .  In  particular,  we  replace  the  I2  inner  product  with  a  certain 
approximating  bilinear  form  (•,  -)fc.  Using  the  approximate  form  leads  to  the  new  problem. 

Find:  udth}  €  Vh  X  Mh  such  that  for  /  £  H~y(i 1), 

(2-8)  <  (°d,h,v)h  -  D(v,Ud,h)  =  0  V  v  £  Vh, 

-D(«7 d.h,w)  =  -(/,  w)  'i  w  £  Mh- 

We  then  consider  the  associated  reduced  system,  perhaps  using  different  computational 
bases 

M  Vd,hQ-d\VlhUd,h  =  F, 

where  Odh  is  the  Gramm  matrix  associated  with  For  quadratic  Ciarlet-Raviart 

spaces  we  take 

s 

(2.10)  (u,v)h  =  £  S>  u)F0,t(v) . 

T£rh  or=l 

with  o  indexing  the  nodes  and  the  midpoints  of  the  edges  of  triangle  T.  When  a  selects 
a  node  na  we  use  a  weight  u ;aj  =  §area(T)  and  define  the  functional  =  u(na). 

If  a  indexes  a  midpoint  on  an  edge  of  T  which  joins  node  n,J+  to  node  na-,  then  we  set 
^ a,T  =  |area(T)  and  Faj{u)  =  u{na)  -  |(«(nQ+)  +  u( «„-)).  It  is  shown  in  [14]  that  the 
problem  (2.8)  has  a  unique  solution  with  approximation  properties  comparable  to  those  of 
{'Tft.  Uh},  the  Ciarlet-Raviart  approximate  solution.  Specifically,  we  have 

il«  ~  “oil,  <  Ch2\\u\\3  and  jja  -  <r,iM\\0  <  C/i]|«||  ,  . 

Furthermore,  in  [14]  a  basis  {ft}  for  the  space  Vh  is  constructed  for  which 
~V{ft,<t>')  is  sparse  and  =  (ft.ft)h  is  diagonal,  and  hence  is  trivially  inverted. 
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2.2.  The  Morley  method.  A  natural  weak  formulation  of  (2.1)  seeks  a  function 
u  6  such  that 

(2.11)  /  AuAvdx  =  /  fvdx  Vv  G  Hq(Q)  . 

J  q  J  n 

To  obtain  a  so-called  conforming  Unite  element  method  from  this  weak  formulation  requires 
finite  element  spaces  which  are  contained  in  Hq(Q.) — this  leads  to  the  use  of  continuously 
differentiable,  hence  complex,  piecewise  polynomial  spaces. 

An  alternative  nonconforming  finite  element  method  has  been  proposed  by  Morley  [23]. 
Let  Th  denote  a  regular  and  quasi-uniform  triangulation  of  the  domain  Q.  The  spaces  of 
Morley  are  defined  so  that  v  G  Mh  if  and  only  if: 

(a)  v  restricted  to  each  triangle  T  6  ffc  is  a  quadratic  polynomial, 

(b)  v  is  continuous  at  triangle  vertices  and  vanishes  at  boundary  vertices,  and 

(c)  the  normal  derivative  is  continuous  at  triangle  edge  midpoints  and  vanishes  at 
midpoints  along  §Sl. 

The  Morley  method  approximates  the  solution  of  (2.11)  with  the  solution  to  the  following 
problem. 


(2.12) 


Find:  u h  G  M h  such  that  for  /  G  Jc2(fi) 

/  Auh.Avdx—  I  fvdx  V  w  G  Mh- 
TZtJT  jQ 


Note  that  the  elements  of  Mh  are  not  continuous  at  Th  edge  midpoints.  It  is  not  difficult 
to  see  that  if  rh/ 2  is  constructed  by  joining  edge  midpoints  in  Th,  then  Mh  is  not  contained 
in  Mh/2-  Consequently,  multigrid  methods  for  Morley  discretizations  involve  nonnested 
spaces. 

It  can  be  shown  that  Dtst*  fj  A(-)A(-)dx  is  an  inner  product  for  the  space  Mh- 
Consequently,  (2.12)  has  a  unique  solution.  The  Morley  approximation  u/,  is  known  to 
satisfy 


((«  -  <  O  A  [||«||3  +  A  H/llo] , 

where  u  denotes  the  solution  of  (2.1)  (or  (2.11)),  and  fT  AvAv  dx 

Since  Mh  contains  discontinuous  elements,  the  requirement  /  G  Z.2(fi)  is  necessary 
for  Jq  fvdx  in  (2.12)  to  make  sense.  Let  v1  denote  the  piecewise  7vlinear  interpolant  of 
v  G  Mh-  Then  the  Morley  method  may  be  extended  to  allow  /  g  H~x ,  (since  v1  G  Hq(SI)) 
as  follows, 

Find:  Uh  G  Mh  such  that  for  /  C  //-1 
*  ^  j  AUhAvdx  =  J  fv1  dx  V  w  G  Mh- 

T£rh  -'T 

The  unique  solution  uh  of  this  problem  is  known  to  satisfy  ihe  error  estimate  [1] 

11“  “  “ftll 2.h  <  C'fa||«ll3  • 

Finally,  let  {&}  denote  a  computational  basis  for  the  space  Mh-  We  may  write  (2.12) 
in  matrix  form 


MhUh  -  Fh  , 

where  Uh  is  the  coefficient  vector  for  Uh  and  [/),],  =  fn  fh<t>'  dx. 
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3.  Multigrid  Algorithms.  In  this  section  we  will  describe  several  multigrid  ap¬ 
proaches  and  apply  them  to  the  finite  element  methods  of  the  previous  section.  In  par¬ 
ticular,  we  shall  describe  a  multigrid  W-cycle  iteration  which  has  as  a  parameter  m  the 
number  of  (symmetrically  applied)  smoothing  iterations  computed  during  each  iteration. 
As  this  standard  W-cycle  will  diverge  unless  m  is  larger  than  an  (a  priori)  undetermined 
value,  we  propose  a  second,  more  robust,  multigrid  approach.  We  show  that  an  effective 
multigrid  preconditioner  c an  be  constructed,  and  that  the  action  of  this  preconditioner  can 
be  computed  as  the  outcome  of  an  ordinary  “Variable  V-cycle”  multigrid  iterative  step.  We 
shall  also  show  that  the  W-cycle  is  itself  a  most  simple,  in  this  case  a  too  simple  multigrid 
preconditioned  iteration. 

3.1.  Overview.  Consider  a  nested  sequence  of  quasi-uniform  (in  the  sense  of  Ciarlet 
[10])  triangulations  of  a  domain  ft  with  mesh  diameters  {hfc}*=i  7  satisfying  a  growth 
condition 

(3.1)  hk-i  <  , 

and  with  6  independent  of  k.  Beginning  with  a  coarse  triangulation  tv ,,  of  ft.  such  a 
sequence  may  be  obtained  by  joining  the  midpoints  of  the  edges  of  mesh  Tkk_x  to  form  mesh 
Thk.  On  each  mesh  we  construct  a  Ciarlet- Raviart  or  a  Morley  space  Mhk  and  consider  the 
associated  finite  element  methods.  (In  the  sequel,  subscripts  h*  will  be  replaced  by  the 
subscript  k.)  Given  local  bases,  denoted  by  {^},  these  methods  lead  to  linear  systems 

(3.2)  AkUk  =  Fk. 

Here,  Ak  denotes  either  the  matrix  Vd-hQ-fl  ^d.k  or  vVf*  of  Section  2.  and  IJk  denotes  the 
coefficient  vector  of  the  approximate  solution  Udtk,  or  Uk ,  for  the  Ciarlet-Raviart,  or  for 
the  Morley  method,  respectively.  We  wish  to  obtain  for  k  ~  j  the  solution  to  problem 

(3.2)  — the  solution  on  the  finest  mesh. 

Multigrid  analyses  commonly  study  quadratic  forms  associated  with  the  matrices  Ak- 
Let  U  and  V  denote  coefficient  vectors  of  u  and  v,  elements  of  Mk  (i.e..  v  =  )• 

We  define 

A k(u.v)  =  (AkU,V)  . 

where  (•, ■)  denotes  the  Euclidean  inner  product.  Let  (•,•)*  denote  inner  products  tor  the 
spaces  Mk  for  which  the  induced  norms  |||u|||ofc  =  (u, u)^2  are  uniformly  (with  respect  to 
k)  equivalent  to  the  L2(ft)  norm.  The  inner  products  ( •, - )fc  could  be  inner  products 
and  need  not  be  related  to  the  approximate  Z,2  inner  products  mentioned  in  Section  2.1. 
For  each  space  Mk,  let  Ak  :  Mk  — *•  Mk  denote  the  symmetric  (self-adjoint  with  respect  to 
the  inner  product  (-,  • )k )  positive  definite  operator  satisfying 

A ki  u.  v)  =  (A ku.  v)k  V  u.  v  £  Mk  ■ 

(The  self-adjoint, ness  of  Afc  is  a  consequence  of  the  symmetry  of  Ak-)  With  /*  £  Mk  defined 
so  that  (  fk,  v)k  =  (Fk,  V)  =  fn  fvdx  for  all  u  £  Mk,  we  may  rewrite  the  equations  (3.2)  in 
an  equivalent  operator  form  - 


Remark  1.  The  quadratic  forms  obtained  for  the  Ciarlet- Ravi  art  method  have  the 
following  useful  representation, 

(Vd,kQ^kV^kU,U)1/2  =  {QZK<kUMi.kQ^VlkU)m 

(QdAQ:\vikU),v) 

sup  - r~p. - 

vevfc\{o}  (Qd,k  V,  V)1/2 

D  ( v,u ) 

SUp  - rjr, 

veVkMo}  (v,v)hk 

making  use  of  the  fact  that  (Qd,k',')  induces  an  inner  product  for  the  space  Vk.  It  is 
not  surprising  then  that  these  forms  are  “noninherited”,  i.e.,  that  for  k  <  j  there  is  some 
u  €  Mk  C  Mj  such  that  A k(u,u)  -fc  A j(u,u).  □ 

To  define  a  multigrid  algorithm,  assume  further  that  one  has  linear  mappings,  '‘pro¬ 
longations”,  Ik  :  Mk- 1  — *■  Mk.  For  example,  if  the  space  Mk- 1  is  contained  in  Mk,  then 
Ik  may  be  taken  to  be  the  natural  injection  operator.  See  [9]  for  a  suitable  4  in  the 
Morley  setting.  Let  Ik  denote  the  matrix  for  4  given  in  terms  of  the  computational  bases 
for  Mk  and  Mk-\.  It  can  be  shown  that  the  operator  Pk_x  '■  Mk  — ►  A4-i  induced  by  the 
transposed  matrix  lk  satisfies 

(3.5)  (-P*-!u,  v)*:_i  =  (uMkv)k  V  v  e  -V4-i- 

Finally,  we  shall  employ  linear  “smoothing  iterations”  associated  with  the  problems 
(3.2),  for  k  =  2,..., j.  We  postpone  the  discussion  of  the  suitability  of  a  smoothing 
iteration,  but  now  claim  that  point,  line,  or  block  Jacobi  or  Gauss-Seidel  iterations,  or  the 
Richardson  iteration,  may  be  effectively  used.  These  smoothing  iterations  can  be  expressed 
in  terms  of  procedures 

X'  =  TZk(X'~l ,  Fk), 


A  *K*)1/2  = 

(3.4) 


where  each  7 Zk[-,-)  satisfies 

(1)  (consistency)  X  =  IZk(X,  AkX )  for  all  A",  and 

(2)  (linearity)  nk(X,Y)  +  aKk(U,V )  -  Rk{X  +  aU,Y  +  aV). 

Alternatively,  if  we  define  a  matrix  1Zk  so  that  RkX  —  Rk{ 0,  A' )  for  all  A’,  then  we  may 
rewrite  the  smoothing  iteration.  Combining  properties  ( l)  and  (2)  we  have 

A'*-1  =  72fe(0,  AA',_I)  -f  Rk(Xl~\0) 

hence 

A'*  =  1Zk(X'-\Fk) 

=  nk{X'-\o)  +  nk((),Fk) 

=  A-1  +72fc(0.n-^A'-'> 

(3.6)  A’  =  A'*1  FRkiFk-AkX'-1]  . 

The  smoothing  procedures  Z  =  Rk(X,  F)  induce  analogous  procedures  z  —  Iik[x «  / )  acting 
directly  on  elements  of  the  space  Mk.  Here  A  and  Z  denote  coefficient  vectors  of  x  €  14 
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and  z  €  Mk  and  /  is  related  to  the  ith  component  of  F  by  [Z1],-  =  {f,4>'k)k.  Consistency, 
x  —  Rk( x,  Akx),  and  the  linearity  of  Rk(-,  •)  follow  from  the  same  properties  of  7 ?■*(•,-)• 
From  Rk(-,  •)  we  obtain  a  smoothing  operator  Rk  :  Mk  -*  Mk,  Rkx  =  Rk{ 0,  j),  and  we  can 
write  the  smoothing  iteration  (3.6)  in  operator  form 

(3-7)  x’W-1 +  £*(/*-  A**-1) 

where  fk  satisfies  (fk,  v)k  =  (Fk,  V). 

Remark  2.  In  order  that  we  may  later  symmetrize  our  multigrid  schemes,  we 
allow  for  a  somewhat  more  general  smoothing;  cf.  [8].  For  example,  if  Rk  denotes  the 
(asymmetric)  smoothing  operator  induced  by  one  particular  Gauss-Seidel  sweep,  then  Rk 
is  the  operator  which  corresponds  to  a  Gauss-Seidel  sweep  in  the  “'reverse’’  direction.  Here, 
the  superscript  t  denotes  the  adjoint  or  transpose  with  respect  to  (•,  -)fc.  We  shall  define  a 
Gauss-Seidel  operator  Rk^  so  that 

n  (*)  (  Rk  if  i  is  odd, 
k  |  Rk  if  i  is  even, 

and  consider  smoothing  iterations 

(3.8)  xx  =  x'~l  +  Rk\fk  -  A**1-1) . 

We  shall  use  the  notation  X'  =  Fk)  to  denote  the  action  of  this  alternating 

smoother.  □ 

We  may  now  define  a  rather  general  symmetric  multigrid  process  for  iteratively  solving 
(3.2).  Given  an  initial  approximation  Z[_1  to  the  solution  Uk  of  the  problem  AkUk  =  Fk , 
compute  an  improved  approximation,  Zk  =  Mg k{,Zlk1,Fk).  The  procedure  Mgfc(-,  •)  is 
defined  below  by  the  recursive  Algorithm  1.  Setting  p  =  2  in  this  algorithm  and  using 
m(k)  —  m  smoothings  for  each  level  k  yields  a  multigrid  W-cycle.  With  p  —  1.  a  V-cycle  is 
obtained.  It  is  possible  to  increase  the  number  of  smoothings  m(k)  as  k  decreases  without 
significantly  increasing  the  cost  required  to  compute  Mg*(-,  •).  We  shall  refer  to  a  Variable 
V-cycle  as  that  scheme  obtained  from  Algorithm  1  with  p  ~  1  and 

0o  m(k)  <  m(k  -  1)  <  f3\  m(k)  and  1  <  0o  <  0i  • 

Observe  that  the  Variable  V-cycle  with  m(k)  ~  23~k  and  the  W-cyde  with  m  —  1  require 
the  same  number  of  smoothing  iterations  for  each  level,  and  hence  require  roughly  the  same 
computational  effort  to  perform. 

It  can  be  shown  that  the  multigrid  procedure  is  linear  and  is  also  consistent  with 
X  =  Mgfc(Ar,>ffc-V).  Consider  again  the  multigrid  solution  of  (3.2)  which  generates  iterates 

(3.9)  Zl  =  MgJ(Zl-\Fj) 

from  an  initial  guess  Z°.  Repeating  the  arguments  used  to  obtain  (3.6)  and  defining 
matri'-es  Bk  so  that  BkX  =  Mg^O,  A),  we  may  rewrite  (3.9)  in  the  form 

(3.10)  Z‘  =  Z'"1  +Bj(Fj  -A}Z1”'). 
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proced  ure  Mg k  ( Zlk~ 1 ,  Fk ) 

if  k  =  1,  solve  exactly 

return  Mg Fx)  = 

else,  define  Fk)  in  terms  of  Mg*.^-,  •)  as  follows: 

0.  initialize  X°  =  Zj*1  and  Q° 

1.  smooth  m(k)  times 

Xi  =  X'~l  +  nj?\Fk  -  AkX''1),  i=  l,...,m(*) 

2.  perform  a  coarse  grid  correction 

Ym(k)  -  xm{k)  +  Xk  Qp 

where 

Q*  =  -  AkXmW)),  j  =  1,..., p 

3.  smooth  m(k)  more  times 

Yi  =  y-1  +  K^iFk-AkY'-1),  I  =  m(k)  +  1 . 2  m(k) 

return  Mgfc(Z['1,  Fk )  =  Y2mW 


ALGORITHM  1.  A  symmetric  multigrid  procedure. 
or  in  an  equivalent  operator  form 

(3.11)  zl  —  zl~l  +  Bj(fj  -  AjZl~l)  . 

We  conclude  that  the  multigrid  iteration  is  simply  a  linear  iterative  scheme  for  solving  the 
preconditioned  system 

(3.12)  #7*4.7  Uj  =  #7#7  or  BjAjUj  =  Bjfj  . 

Denoting  the  /th  error  el  =  Uj  -  z1,  from  (3.11)  it  is  clear  that 

/  _  XT'  /  —  1 

e  =  tjje 

holds  for  a  linear  error  reduction  operator  E;  :  Mj  —  Mr  In  fact, 

(3.13)  Ej  =  l  -  BjAj  . 

If  the  smoothing  iterations  are  “symmetrically  performed”,  see  Remark  2,  then  B;  can  be 
shown  to  be  symmetric  with  respect  to  (•,  -)j.  Consequently,  Ej  is  symmetric  with  respect 
to  the  inner  product  (A^  -,-)^ .  The  multigrid  iteration  (3.9)  is  contracting  provided  that 
the  eigenvalues  of  the  operator  E j  are  contained  in  the  interval  (-1,  l).  or  if  ||E;|j  <  6  <  1. 
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Equivalently,  the  linear  iterative  scheme  (3.11)  converges  provided  that  the  eigenvalues  of 
BjAj  are  contained  in  (0,2).  For  a  result  in  this  direction,  see  Theorem  4. 

If  the  maximum  eigenvalue  of  BjAj  is  larger  than  2,  then  (3.11)  generally  diverges; 
although,  a  different  iterative  scheme  for  solving  (3.12),  conjugate  gradients  for  example, 
may  rapidly  converge.  This  is  an  important  observation  for  the  multigrid  iteration  when  it 
is  applied  to  the  finite  element  methods  of  Section  2.  Indeed,  the  W-cycle  with  m  —  1  may 
diverge  for  these  methods.  However,  the  problem  (3.12)  with  B;  obtained  from  the  Variable 
V-cycle  with  p  =  1  and  m(k)  =  2^~k  has  a  small  condition  number — independent  of  the 
mesh  oiameter  hk — and  is  rapidly  solved  by  a  conjugate  gradient  iteration;  rf.  Theorem  5. 
Even  when  ||Ej||  <  6  <  1  so  that  the  multigrid  iteration  (3.11)  converges,  the  spectral 
condition  number  of  B_,Aj  satisfies 


K2(BjAj)  =  K2{BjAj)  = 


^max(B;  Aj ) 

^min(Bj  Aj  ) 


< 


1  +  6 
1  -  6 


and  it  is  often  faster  to  solve  (3.12)  with  a  conjugate  gradient  iteration. 

Using  the  inner  product  [•,  •]  =  (BJ1-,  •)  in  which  B:Aj  is  symmetric,  a  conjugate  gra* 
dient  algorithm  for  BjAjUj  =  BjFj  can  be  derived;  see  Algorithm  2.  (Note  that  the  matrix 
BjA:  may  not  be  symmetric.)  Algorithm  2  happens  to  be  identical  to  the  preconditioned 
conjugate  gradient  algorithm  for  the  problem  BX/2  A:BXJ2  (B~il2  U })  =' B1/2 1)  described  in 
[12]. 


I  =  0;  A0  =  0;  R°  =  Fj 
while  || Z‘\\  >  e  \\Z°\\ 

compute  Zl  =  B}Rl 
1  =  1  +  1 
iff  =  1 

P1  =  Z° 

else 

p  =  {R}-l)tZ1-'  KRl~2)tZ1-'1 
Pl  =  Zl~x  +  (3  P1-1 

end 

a  =  {Rl-lyzl~l /(P'YAjP1 
X^X'-'+aP* 

Rl  =  Rl~l  ~aAjP‘ 

end 

ALGORITHM  2.  A  multigrid-preconditioned  conjugate  gradient  algorithm. 

3.2.  Convergence  theory.  We  now  outline  the  convergence  theory  for  the  multi- 
grid  schemes  of  the  previous  section  following  the  approach  taken  in  [8].  The  analysis  is 
performed  using  operator  notation  and  is  based  upon  two  conditions.  Before  introducing 
the  first  condition,  which  concerns  the  smoothing  iteration,  it  is  convenient  to  define  an 
error  operator  Kk  =  /  —  FfcAfc  and  its  adjoint  with  respect  to  the  inner  product  (Ai.-,  •),  . 

Kt  =  I~  K A*- 
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(C.l)  There  is  a  constant  Cr  independent  of  k  such  that  the  smoothing  procedure  satisfies 

(3.14)  <  Cn(Rku,  u)k  Vue  Mk, 

A k 

for  both  Rk  =  (I-  KkKje)Ak1  and  ~Rk  -  (I- KkK£)Afl,  where  Xk  is  the  largest 
eigenvalue  of  Afc,  and  |||u||0]fc  denotes  the  norm  induced  by  the  inner  product  {■,■)*.. 

In  [7]  it  is  shown  that  (C.l)  is  equivalent  to  the  condition  that  the  smoothing  iteration 
(3.8)  converge  at  a  rate  exceeding  that  of  a  Richardson  iteration  defined  by  Rk  =  I 
with  u  =  Cp~l.  That  paper  then  proves  (C.l)  for  a  class  of  smoothers  defined  by  subspace 
decomposition  and  which  satisfy  simple  hypotheses.  In  particular,  point,  line,  and  block 
Jacobi  or  Gauss-Seidel  iterations  satisfy  (C.l). 

Remark  3.  The  multigrid  algorithms  described  in  [7,  8]  use  the  following  iteration 
as  a  smoothing 

x{  =  x<-1  +  R{ki+m{k))  (fk  -  Akx'~l). 

This  iteration  differs  from  (3.8)  only  notationally.  In  fact,  the  meaning  given  to  R^  (e.g. 
the  sweep  direction  for  the  first  Gauss-Seidel  smoothing)  can  be  chosen,  perhaps  differently, 
for  each  notation  so  that  the  two  iterations  are  identical.  □ 

The  second  condition  is  expressed  in  terms  of  the  adjoint  Pk-\  :  Mk  —  Mk- 1  of  Ik 
taken  with  respect  to  the  inner  products  Ajt(-,  •)  =  (A^-,-)*.  and  Afc_i(-,  •), 

(3.15)  Ak-i(Pk-}U,v)  =  Ak(u,Ikv),  VveMic-i- 

(C.2)  “Approximation  and  Regularity”  —  for  some  a  e  (0, 1]  with  independent  of  k, 

(3.16)  )Afe((J  -  [kPk.i)u,u)\  <  Ak(u .u)1"*  VueMk. 

The  condition  (C.2)  is  typically  proved  using  the  approximation  properties  of  the  spaces 
Mk  and  the  elliptic  regularity  (2.3)  of  the  underlying  partial  differential  equation. 

We  consider  first  a  result  for  the  W-cycle.  For  the  Morley  method,  convergence  of  the 
W-cycle  iteration  was  first  proved  by  Brenner  [9]. 

THEOREM  4.  (W-cycle)  If  the  conditions  (C.l)  and  (C.2)  are  satisfied,  then  the  m- 
smoothing  W-cycle  iteration  defined  by  Algorithm  1  converges  for  sufficiently  large  m,  and 
E*  =  I  —  BjfcA*  is  a  contraction,  with  contraction  number  ( independt  nt  of  k)  given  by 

M  +  ma 

where  M  is  a  constant,  M  =  M(rx,Ca,C  r).  Furthermore,  the  same  conclusion  holds  if  "m 
is  sufficiently  large ”  is  replaced  by  the  assumption 

(3.17)  Ak(Iku,Iku)  <  2 A/t_i(u, u)  Vu£Mk-\- 
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An  explicit  expression  for  M  can  be  found  in  [6].  For  the  bilinear  forms  A;-(  •,  -)  obtained 
for  the  Mortey  or  for  the  modified  Ciarlet- Ravi  art  method,  the  condition  (3.17)  fails.  And 
a?  previously  noted,  “m  sufficiently  large”  can  mean  m  >  8  in  this  context;  see  Section  4. 
Not  only  will  the  W-cycle  diverge  for  small  m,  but  numerical  results  show  that  for  the  W- 
cycle,  the  preconditioned  operator  B}Aj  may  be  indefinite.  It  is  not  certain  that  a  conjugate 
gradient  iteration  for  (3.12)  will  then  converge.  The  next  Theorem  will  demonstrate  that 
one  does  not  encounter  this  difficulty  with  the  Variable  V-cycle. 

Theorem  5.  (Variable  V-cycle)  If  the  conditions  (C.l)  and  (C.2)  are  satisfied,  then 
the  Variable  V-cycle  multigrid  algorithm  yields  preconditioners  B*  such  that 


(3.18) 
with 

(3.19) 


Vo  A  k(u,  u)  <  Ak(BkAku,u)  <  Ak(u,u). 


Va6  Mk, 


>  m(fc) 

~  M  +  m(k)a  ’ 


*71  < 


M  f  mjkfi 
m(k)a 


where  M  is  a  constant,  M  —  M(a,  Ca,  Cr). 

As  consequence  of  this  theorem,  the  spectral  condition  number  of  B^At  satisfies 


«2(BfcAi)  < 


rp  <  /M  +  m{kf\ 2 
r)o  ~  \  m{kf  ) 


and  the  multigrid- preconditioned  conjugate  gradient  solution  of  (3.12)  converges  with  an 
asymptotic  rate  of 


(3.20) 


^(BjAj)  -  1  -V/ 

y/^BjAj)  +  1  "  M  +  2mU)a 


per  iteration. 

A  proof  of  (C.2)  for  the  Ciarlet- Raviart  mixed  method  may  be  found  in  [14].  A  proof 
of  (C.2)  for  the  Morley  method  appears  in  [5].  The  preconditioning  properties  of  the 
simple  V-cycle  are  questionable  in  comparison  to  those  of  the  Variable  V-cycle.  Numerical 
experiments  described  in  Section  4  suggest  that  this  condition  number  may  not  be  bounded 
independent  of  the  mesh. 

4.  Numerical  Results.  According  to  Theorem  4,  convergence  of  the  standard  W- 
cycle  iteration  applied  to  the  Ciarlet- Raviart  or  the  Morley  method  is  assured  provided 
that  enough  smoothing  iterations  are  performed.  Guarantees  of  convergence  are  provisional 
since  (3.17)  is  not  satisfied.  In  this  section,  we  will  present  computations  which  show  that 
W-cycle  iterations  with  minimal  smoothing  are  not  robust  here,  these  iterations  commonly 
diverge.  Rather  than  abandon  multigrid  for  these  finite  element  methods,  we  advocate  that 
Theorem  5  be  employed  in  these  situations.  According  to  this  Theorem,  a  single  Variable 
V-cycle  generates  a  robust  preconditioner  for  (3.2). -The  effectiveness  of  this  preconditioner 
when  combined  with  a  preconditioned  conjugate  gradient  (PCCG)  iteration  is  illustrated 
by  a  second  set  of  computations. 


268 


In  the  following  computations  the  modification  to  the  Ciarlel-  Ravi  art  method  and 
piecewise-quadratic  spaces  have  been  used.  Using  the  discrete  forms  defined  by  (2.10) 
makes  it  affordable  to  apply  Gauss-Seidel  smoothing  iterations  to  the  Ciarlet-Raviart 
Schur  complement.  (This  is  another  good  reason  to  use  the  modified  Ciarlet-Raviart 
method.  Similar  multigrid  behavior  is  observed  for  the  standard  Ciarlet-Raviart  method 
with  Richardson  smoothing.)  We  construct  a  sequence  of  nested  quasi- uniform  triangu¬ 
lations  of  Q,  =  [0, 1]  X  [0, 1]  from  a  coarse  mesh  by  joining  triangle  midpoints  to  get  finer 
meshes.  For  each  computation,  the  coarsest  mesh  consists  of  two  triangles. 

The  first  set  of  computations  examines  five-level  W-cycles  as  the  number  of  multigrid 
smoothing  iterations,  m,  is  increased.  Eigenvalues  Amin  and  Amax  of  the  “W-cycle  precon¬ 
ditioned”  operators  BfcA*  are  displayed  in  Table  1  for  k  =  5  and  both  Richardson  (u>  =  1.5) 
and  Gauss-Seidel  smoothing.  Recall  from  Section  3.1  that  the  error  reduction  operator  for 
the  standard  multigrid  iteration  .atisfies  E*  =  I  —  B^A*,  cf.  (3.13).  Consequently,  if 
the  spectrum  of  B^A/t  is  not  contained  in  the  interval  (0,2),  then  the  standard  multigrid 
iteration  may  diverge.  Alternatively,  one  might  consider  the  multigrid  reduction  factors 
6  =  max(|Amax  —  1 1 , 1 1  —  Amtn|).  In  practice  we  find  that  6  =  Amax  ~  1.  (This  differs  from 
the  usual  inherited  form  situation  in  which  Amai  <  1  and  6  =  1  -  Amtn.)  For  the  modified 
Ciarlet-Raviart  scheme,  we  see  from  Table  1  that  if  Richardson  smoothing  is  used,  one 
must  take  “m  >  8”  smoothings  per  five-level  W-cycle  iteration  in  order  to  guarantee  con¬ 
vergence.  Clearly  Gauss-Seidel  smoothing  is  to  be  preferred  for  this  problem— yet  m  >  1 
is  still  necessary  for  general  W-cycle  convergence.  Similarly,  for  the  Morley  discretization, 
Richardson  smoothing  is  again  unsatisfactory  for  small  m.  In  this  case,  it.  appears  that 
m  —  1  Gauss-Seidel  smoothing  yields  an  acceptable  W-cycle.  However,  it  is  not  certain 
that  this  will  continue  to  be  true  for  finer  grids  or  for  different  domains  Q.  - 

TABLE  1.  W-cycle  preconditioned  Bj A*,  extremal  eigenvalues  vs.  t n. 


modified  Ciarlet-Raviart 

Morlev 

_ 

m 

Richardson 

Gauss-Seidel 

Richardson 

Gauss-Seidel 

^max 

^ max 

^ min 

^max 

^min 

^ max 

m 

2.48 

-23.3 

2.23 

-1.16 

2.10 

.211 

m 

wxm 

§1111 

1.96 

.119 

1.79 

.377 

1.65 

.815 

l_Li 

2.29 

-1.46 

1.84 

.520 

■Efflil 

1.55 

81 

01 

2.22 

-.645 

1.72 

.694 

1.49 

.922 

;BI 

2.18 

-.328 

1.64 

.757 

1.65 

.694 

1.45 

.948 

m 

1.56 

.806 

- 

- 

- 

- 

7 

mm 

- 

-- 

- 

- 

1.999 

.284 

1.44 

.874 

- 

- 

- 

Next  we  examine  the  effectiveness  of  the  Variable  V-cycle  (3  =  2)  preconditioners  B*... 
Computed  values  for  extremal  eigenvalues  and  condition  numbers  of  the  preconditioned 
operators  B* At,  k  =  2, . . . ,  6.  are  listed  in  Table  2.  According  to  Theorem  5.  the  condition 
numb\  ■  of  B^A^  is  bounded  independently  of  k.  Further  computations  with  varying  3  con 
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firm  that  the  slow  growth  of  Amax  seen  in  this  table  is  not  inconsistent  with  this  Theorem, 
but  is  reminiscent  of  the  upper  bound  in  (3.19), 


-Wr  (BfcAfc)  <  T)i  <  j  , 

where  m(i)  =  2k~t.  For  comparison,  note  that  with  k  =  8  (hk  =  1/128),  the  condition 
number  of  Ak,  the  Schur  complement  from  (2.9),  exceeds  108. 

From  the  condition  numbers  of  Table  2  we  can  estimate  reduction  factors  for  Variable 
V-cycle  preconditioned  conjugate  gradient  iterations.  For  condition  numbers  K2(BfcAfc)  < 
20  we  obtain  SPCcg  <  -68  (averaging  over  10  iterations).  Similarly,  6PCCg  <  .48  for  the 
Morley  method  when  «2(BfcAfc)  <  7.  These  reductions  are  significantly  better  than  those 
obtained  for  W-cycle  iterations.  According  to  Table  1,  KZU  <  -86  and  <  .96  for 
k  —  5.  This  distinction,  which  is  supported  by  iteration  data  (cf.  Table  3).  is  explained 
in  Section  3.2,  see  (3.9)  -  (3.12).  It  was  shown  there  that  the  standard  multigrid  iteration 
is  nothing  more  than  a  preconditioned  iteration  using  a  scheme  which  is  generally  less 
effective  than  the  preconditioned  conjugate  gradient  scheme. 


TABLE  2.  Extremal  eigenvalues  for  variable  V-cycle  Bit  A*. 


modi 

ied  Ciarlet-Raviart 

Morley 

k 

A max 

^min 

^ max / ^min 

Amax 

A  min 

^ max / 

T 

1.22 

0.553 

2.20 

.9999 

0.846 

1.18 

3 

2.18 

0.355 

6.12 

1.47 

0.641 

2.29 

4 

3.38 

0.297 

11.4 

2.05 

0.612 

3.35 

5 

4.45 

0.284 

15.7 

2.75 

0.595 

4.63 

6 

5.07 

0.281 

18.0 

3.40 

0.588 

5.78 

To  further  illustrate  the  behavior  of  the  various  multigrid  schemes  we  provide  Table  3. 
In  particular,  this  data  demonstrates  that,  in  addition  to  being  robust,  the  Variable  V- 
cycle  preconditioner  is  cheaper  to  use  than  an  m  >  1  W-cycle.  Recorded  in  Table  3 
are  the  numbers  of  iterations  required  to  solve  A kuk  =  fk  with  relative  error  less  than 
2  x  10-7.  Relative  error  was  measured  in  the  norm  (A*BjtAjf,  •)k^i  (2  x  10~7  corresponds 
to  approximately  six  correct  digits),  and  iteration  counts  were  averaged  for  five  simple 
test  problems.  Results  are  listed  for  both  the  modified  Ciarlet-Raviart  and  the  Morley 
methods.  In  each  case  Gauss-Seidel  smoothing  was  used.  For  the  W-cycles,  results  for  the 
standard  multigrid  scheme  appear  first  and  are  followed  by  a  slash  and  then  the  results  for 
the  associated  preconditioned  conjugate  gradient  (PCCG)  scheme.  Note  tha.t  the  W-cycle 
may  be  used  as  a  preconditioner  provided  that  m  is  large  enough.  ( rn  >  2  for  the  modified 
Ciarlet-Raviart  method).  In  comparing  results  note  that  an  (rn  =  l)  W-cycle  or  a  (,J  =  2) 
Variable  V-cycle  iteration  is  roughly  50  to  100  percent  more  costly  than  an  ( rn  =  1 )  V-cycle 
iteration.  An  m-smoothing  W-cycle  is  roughly  m  times  as  expensive  as  a  (j3  =  2)  Variable 
V-cycle. 

It  is  not  known  if  the  V-cycle  provides  a  bounded  preconditioning  for  these  problems. 
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TABLE  3.  Convergence  of  multigrtd  iterative  and  preconditioned  schemes. 


modified  Ciarlet-Raviart 

Morley 

variable- V 

W-cycie 

variable-V 

W -cycle 

V -cycle 

PCCG 

standard  multigrid  /  PCCG 

PCCG 

multigrid  /  PCCG 

PCCG 

0  =  2 

m  =  2 

771  =  3 

KB 

CM 

II 

*5. 

m  =  1 

777—1 

6.8 

9.0  /  5.0 

6.6  /  4.2 

6.8 

4.8 

15.2 

53.8  /  10.8 

32.8  /  8.8 

14.8 

9.0 

8.6 

1 

20.8 

21.0 

11.6 

na 

□ 

24.2 

305.6  /  18.8 

72.4  /  12.6 

26.8 

13.4 

79.0  /  i2.0 

14.0 

6 

26.8 

32.8 

16.4 

7 

27.4 

273.4  /  21.8 

72.0  /  14.4 

41.0 

16.4 

187.8  /  14.0 

19.2 

B 

26.6 

207.6  /  21.0 

67.0  /  13.4 

49.6 

16.6 

174.0  /  13.8 

22.0 
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ABSTRACT.  Simulated  annealing  algorithms  for  optimization  over  continuous  spaces 
come  in  two  varieties:  Markov  chain  algorithms  and  modified  gradient  algorithms.  Unfor¬ 
tunately,  there  is  a  gap  between  the  theory  and  the  application  of  these  algorithms:  the  con¬ 
vergence  conditions  cannot  be  practically  implemented.  In  this  paper  we  suggest  a  practical 
methodology  for  implementing  the  modified  gradient  annealing  algorithms  based  on  their 
relationship  to  the  Markov  chain  algorithms. 

1.  INTRODUCTION.  Simulated  annealing  is  a  popular  approach  to  global  optimiza¬ 
tion  of  functions  with  multiple  local  minima.  One  type  of  annealing  algorithm  for  continu¬ 
ous  optimization  involves  simulating  a  Markov  chain  using  a  generalized  Metropolis  (or 
related)  method.  We  refer  .to  these  algorithms  as  Markov  chain  annealing  algorithms 
(MCAA’s).  There  is  a  large  amount  of  theoretical  analysis  and  practical  methodology 
developed  for  the  MCAA’s  (Vanderbilt  and  Louie,  1984;  Bohachevsky  et  al.,  1986;  Corana 
et  al.,  1987;  Brooks  and  Verdini,  1988;  Press  and  Teukolsky,  1991;  Gelfand  and  Miner, 
1992).  However,  the  feasibility  of  MCAA’s  for  high-dimensional  problems  is  questionable. 

Another  type  of  annealing  algorithm  for  continuous  optimization  involves  modifying 
gradient-type  search  algorithms.  Let  U(»)  be  a  smooth  cost  function  on  1RD.  A  standard  gra¬ 
dient  algorithm  for  finding  a  local  minimum  of  U(»)  (and  hence  a  global  minimum  if  U(«)  is 
convex)  is  given  by 

Zk+i  =Zk  -pVU(zk) 


where  p  is  a  step-size  parameter.  A  modified  gradient  algorithm  for  finding  a  global  (or  near 
global)  minimum  of  U(*)  is  given  by 

Xk+i=Xk-pVU(Xk)  +  V2T^Wk 

where  { Wk )  is  a  white  Gaussian  noise  sequence  and  T  is  a  "temperature"  parameter  which  is 
slowly  decreased  as  the  algorithm  proceeds.  The  idea  behind  this  algorithm  is  that  by 
artificially  adding  in  the  noise  term  (via  Monte  Carlo  simulation)  it  is  possible  to  escape  from 
strictly  local  minima.  We  refer  to  this  modified  gradient  algorithm  as  a  gradient  annealing 
algorithm  (GAA).  Now  there  is  some  theoretical  analysis  developed  for  the  gradient  anneal¬ 
ing  algorithm  (Kushner,  1987;  Gelfand  and  Miner,  I991b,c),  but  no  practical  methodology 
that  we  are  aware  of.  On  the  other  hand,  there  may  be  some  hope  of  using  GAA  for  high¬ 
dimensional  problems  with  smooth  well-behaved  cost  functions,  as  it  attempts  to  exploit  the 
smoothness  by  its  use  of  derivatives.  The  goal  of  this  paper  is  to  use  some  theory  from  Gel¬ 
fand  and  Mitter  (1991a)  relating  the  MCAA  and  GAA,  and  some  practical  methodology 
from  Johnson  et  al.  (1989)  for  the  MCAA,  to  develop  a  practical  methodology  for  GAA. 
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2.  MARKOV  CHAIN  ANNEALING  ALGORITHMS.  Most  of  the  theory  and  applica¬ 
tion  of  MCAA  deals  with  discrete  (combinatorial)  optimization.  The  literature  on  MCAA’s 
for  continuous  optimization  is  by  and  large  a  straightforward  generalization  of  the  discrete 
case.  It  is  this  point  of  view  we  discuss  in  this  section.  The  discussion  is  very  brief  and  the 
reader  is  referred  to  the  literature  for  more  details. 

Let  U(')  be  a  cost  function  on  1RD.  We  wish  to  find  an  element  of  1R°  which  minim¬ 
izes  U(#).  A  general  description  of  the  MCAA  for  solving  this  problem  is  as  follows  (we 
only  consider  the  Metropolis  procedure  here): 

Given  a  current  solution  x  e  R°  generate  a  candidate  solution  y  e  IRD 
If  U(y)  £  U(x)  then  accept  y  as  the  next  solution. 

If  U(y)  >  U(x)  then  accept  y  as  the  next  solution  with  probability  exp(-(U(y)  -  U(x))/T); 
(otherwise  the  next  solution  is  the  current  solution  x). 

Here  the  candidate  solution  is  usually  a  probabilistically  generated  perturbation  of  the 
current  solution.  Also,  the  "temperature"  parameter  T  is  slowly  decreased  as  the  algorithm 
proceeds,  making  transitions  to  higher  cost  states  less  likely.  The  algorithm  stops  subject  to 
some  termination  criterion. 

The  MCAA  can  be  precisely  formulated  as  a  continuous  state  Markov  chain  as  follows. 
Let  q(x,y)  be  a  transition  probability  density  from  x  to  y  (x,y  e  R°);  q(x,y)  is  a  probability 
density  for  the  candidate  state  y  giver  the  current  state  x.  The  continuous  state  annealing 
chain  { Yk }  (at  a  fixed  temperature  T)  has  1-step  transition  probability  density  from  x  to  y 
given  by 

(2.1)  p(T,x,y)  =  s(T,x,y)q(x,y)  +  m(T,x)5(y  -  x) 

where 


s(T,x,y)  =  exp 


[U(y)-U(x)f 

T 


and  m(T,x)  is  chosen  to  provide  the  correct  normalization.  Here  [*]+  denotes  positive  part 
and  5(»)  is  a  Dirac -delta  function.  For  a  fixed  temperature  T  this  annealing  chain  { }  has  a 
Gibbs  equilibrium  distribution  with  density  function 

U(x)  , 

T  ’ 


Jt(T,x)  = 


Z(T) 


exp 


Z(T)  =  J  exp 


U(x) 

T 


dx(<  o°), 


and  as  the  temperature  T  tends  to  zero  we  get  7t(T,*)  converging  to  a  density  7t* (•),  which  is 
concentrated  on  the  global  minima  of  U(*).  If  the  rate  of  temperature  decrease  is  slow 
enough,  then  { }  remains  near  the  equilibrium  distributions  and  also  concentrates  on  the 
global  minima  for  k  large  (Gelfand  and  Mitter,  1992)  (we  note  that  the  proof  of  convergence 
in  the  continuous  case  naturally  requires  many  more  technical  assumptions  and  details  than 
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the  discrete  case). 


Unfortunately,  there  is  a  large  gap  between  the  theory  and  application  of  the  MCAA. 
The  main  problem  is  that  the  theoretically  appropriate  rates  of  decrease  for  the  temperature 
are  far  too  slow  for  practical  implementation.  In  practice,  one  needs  a  temperature  schedule, 
a  candidate  generator  and  a  termination  criterion  which  achieve  desirable  tradeoffs  between 
complexity  and  performance. 

A  practical  methodology  for  continuous  state  MCAA’s  can  be  adopted  with  relatively 
few  changes  from  the  methodology  for  discrete  state  MCAA’s  developed  by  Johnson  et  al. 
(1989)  (refinements  of  this  latter  methodology  form  the  basis  for  most  implementations  of 
MCAA’s  in  both  continuous  and  discrete  state-space).  A  key  quantity  in  this  methodology  is 
the  acceptance  probability  Pa(T)  which  is  estimated  by 


(2.2) 


Pa(T)  = 


Na(T) 

N(T) 


where  NA(T)  is  the  number  of  moves  accepted,  and  N(T)  is  the  number  of  moves  attempted, 
at  temperature  T.  Although  there  is  some  motivation  for  allowing  N(T)  to  increase  with 
decreasing  T,  the  experiments  by  Johnson  et  al.  (1989)  suggest  that  there  is  no  real  advantage 
to  doing  so,  and  hence  N(T)  is  fixed  at  some  number  N.  The  methodology  proceeds  by  mak¬ 
ing  this  fixed  number  of  iterations  (attempted  moves)  at  each  of  a  sequence  of  geometrically 
decaying  temperatures,  and  the  initial  and  final  temperatures  are  selected  by  requiring  that 
the  acceptance  probability  be  specified  values.  For  termination  in  the  discrete  case  it  is  also 
required  that  the  running  cost  for  the  best  solution  has  not  decreased  over  the  5  previous  tem¬ 
perature  values;  in  the  continuous  case  considered  here  we  modify  this  to  only  require  that 
the  running  cost  for  the  best  solution  has  not  decreased  by  more  than  a  small  threshold.  A 
summary  description  of  the  algorithm  is  given  below. 


Markov  chain  annealing  algorithm  methodology.  Input  parameters:  p0  (initial  accep¬ 
tance  probability),  pF  (final  acceptance  probability),  p  (geometric  ratio  in  temperature 
schedule),  N  (number  of  iterations  at  any  temperature),  e  (termination  threshold) 

1.  Find  initial  temperature  Tq  such  that  Pa  (To)  =  po  ,(Pa(To)  given  by  Equation  (2.2)). 

2.  Setj  =  0. 

3.  Run  the  annealing  chain  N  iterations  at  temperature  Tj  =  p^To 

4.  Let  Y*  be  the  best  solution  found  through  temperature  Tj 
If  Pa(Tj)  <  pF  and  U(Y*)  >  U(Y*_5)  -  e 

then  terminate  the  search  and  output  Y*  and  U(Yj*) 

else  set  j  =  j  +  1  and  go  to  3.  □ 

Although  there  have  been  some  successes  reported  with  MCAA’s  of  the  general  type 
described  above,  it  has  been  observed  that  the  method  is  very  inefficient  for  high  dimen¬ 
sional  problems,  essentially  because  it  does  not  exploit  the  smoothness  of  the  cost  function. 
In  the  next.section,  we  discuss  the  GAA  which  may  overcome  this  inefficiency  in  some  prob¬ 
lems. 
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3.  GRADIENT  ANNEALING  ALGORITHM.  Let  U(*)  be  a  smooth  cost  function  (at 
least  C2)  on  R°.  We  wish  to  find  an  element  of  R°  which  minimizes  U(#).  Here  we  con¬ 
sider  GAA  as  an  alternative  to  the  MCAA  described  in  Section  2. 

The  GAA  (with  a  fixed  step  size  p  and  temperature  T)  is  given  by  the  following  sto¬ 
chastic  recursion 

(3.1)  Xk+1  =  Xk  -  pVU(Xk)  +  V2Tp  Wk 

where  {Wk}  is  a  standard  D-dimensional  white  Gaussian  noise  sequence,  artificially  added 
in  (via  Monte  Carlo  simulation)  to  try  to  avoid  getting  trapped  in  local  minima,  and  the  tem¬ 
perature  T  (and  possibly  the  step-size  p)  is  slowly  decreased  as  k  gets  large.  The  asymptotic 
(large-time)  behavior  of  GAA  and  MCAA  are  similar.  For  fixed  temperature  T  and  small 
step-size  p  the  process  {Xk }  nearly  has  a  Gibbs  equilibrium  distribution  with  density  func¬ 
tion  7t(T,*),  and  as  the  temperature  T  tends  to  zero  we  get  Jt(T,*)  converging  to  a  re*  (•)  which 
is  concentrated  on  the  global  minima  of  U(*).  If  the  rate  of  temperature  and  step-size 
decrease  are  chosen  appropriately,  then  {Xk }  remains  near  the  equilibrium  distributions  and 
also  concentrates  on  the  global  minima  for  large  k  (Kushner,  1987;  Gelfand  and  Mitter, 
1991b, c). 

GAA  is  plagued  by  the  same  gap  between  theory  and  application  as  the  MCAA’s. 
Theoretically  appropriate  rates  of  decrease  for  the  temperature  schedule  are  too  slow  for 
practical  implementation,  and  no  results  are  available  concerning  the  important  case  of  fixed 
step-size  which  by  analogy  with  standard  gradient  algorithms  is  necessary  for  rapid  conver¬ 
gence.  In  practice  one  needs  a  temperature  schedule,  a  step-size  (assumed  known  and  fixed 
here)  and  a  termination  criterion  which  achieve  desirable  tradeoffs  between  complexity  and 
performance.  Practical  implementation  of  GAA  appears  not  to  have  received  any  attention 
in  the  literature. 

We  shall  suggest  a  practical  methodology  for  GAA  based  on  the  methodology  for 
MCAA’s  discussed  in  Section  2,  and  the  relationship  between  GAA  and  MCAA  which  we 
shall  elaborate  on  below.  We  shall  show  that  GAA  and  a  certain  class  of  MCAA  interpo¬ 
lated  into  continuous  time  (with  step-size/interpolation  interval  p)  both  have  a  diffusion  limit 
(as  p  -»  0),  and  these  diffusion  limits  are  linearly  time-scaled  versions  of  one  another. 
Hence  by  taking  into  account  the  appropriate  time-scaling,  we  can  use  the  MCAA  methodol¬ 
ogy  as  a  basis  for  a  GAA  methodology.  An  important  feature  of  this  approach  is  that  it 
allows  us  to  implicitly  associate  the  idea  of  acceptance  probability  with  GAA  -  a  critical 
quantity  in  developing  temperature  schedules  and  initialization  and  termination  criterion  for 
most  practical  annealing  schemes. 

3.1.  Diffusion  Limits  for  MCAA  and  GAA.  We  first  formulate  a  MCAA  which  has  the 
appropriate  structure  and  scaling  to  admit  a  diffusion  limit.  Referring  to  the  general  version 
of  the  MCAA  in  Section  2  we  consider  here  a  transition  density  q(*,*)  which  corresponds  to 
selecting  a  coordinate  direction  at  random,  and  then  making  a  Gaussian  perturbation  along 
that  coordinate.  Let  x;  denote  the  i-th  coordinate  of  x  e  IRD.  We  choose 

(3-2)  q(x,y)  =  -f-£N(xi,ap)(>'i)n5(yj  -  Xj) 

u  i=l  j*i 
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where  N(m,o2)(#)  denotes  a  (scalar)  Gaussian  density  with  mean  m  and  variance  o2.  Note 
that  the  variance  of  the  Gaussian  perturbation  along  the  selected  coordinate  is  a p  where  a 
does  not  depend  on  T  (so  that  q(*,*>  does  not  depend  on  T)  and  is  to  be  specified.  Let  ( Yk } 
be  a  Markov  chain  with  1-step  transition  density  p(T,«,*)  given  by  Equations  (2.1)  and  (3.2). 
Interpolate  { Yk  =  Y£:  k  =  0,1,...}  into  a  continuous-time  process  { Y^t):  t  ^  0}  by 

Y*(t)  =  Y£,  tc  [kp,(k+l)p),  k  =  0,l,... 

It  can  be  shown  (Gelfand  and  Mitter,  1991a)  that  Y4(*)  has  a  diffusion  limit,  i.e  , 
Y*1!*)  ->  Y(«)  as  p  i  0  (in  law),  where  Y(»)  satisfies  the  Ito  equation 

dY(t)  =  -  ^~VU(Y(t))dt  +  J|dV(t) 

and  V(»)  is  a  standard  D-dimensional  Wiener  process. 

Next,  consider  the  GAA  {Xk }  given  by  Equation  (3.1).  Interpolate  {Xk  =  X£:  k  = 
0,1,...}  into  a  continuous-time  process  (X^(t):  t  ^  0}  by 

X^(t)  =  XU,  t  g  [kp,(k+l)p),  k  =  0, 1,... 

It  is  easy  to  show  that  X^(*)  also  has  a  diffusion  limit,  i.e.,  X^*)  — >  X(‘)  as  p  i  0  (in  law), 
where  X(«)  satisfies  the  Ito  equation 

dX(t)  =  -VU(X(t))dt  +  V2TdW(t) 

and  W(*)  is  a  standard  D-dimensional  Wiener  process. 

Now  consider  the  process  defined  by  linearly  scaling  time  by  a  factor  (3  >  0  in  the  pro¬ 
cess  X(«): 


X(t)  =  X(Pt) 

By  standard  calculations  X(')  satisfies  the  Ito  equation 

dic(t) = -pvu(X(t))dt  +  VrrpdW(t) 

where  W(*)  is  a  standard  D-dimensional  Wiener  process.  From  the  (assumed)  uniqueness  of 
the  Ito  equation  solution,  it  is  seen  that  if  we  take 

ft  =  R(T)  =  -2— 
h  hv  )  2TD 

then  Y(«)  =  X(»)  (in  law),  i.e.,  Y(*)  is  a  linearly  time-scaled  version  of  X(»),  with  scale-factor 
P(T)  depending  inversely  on  the  temperature  T. 

3.2.  Toward  a  Methodology  for  GAA.  In  view  of  the  limit  diffusion  behavior  exhibited 
by  both  GAA  and  MCAA,  we  have  that  under  suitable  conditions,  GAA  is  close  to  a  linearly 
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time-scaled  version  of  MCAA.  This  suggests  that  we  can  use  the  MCAA  methodology  to 
guide  the  GAA  methodology  by  correcting  for  the  scaling  (this  is  not  to  say  that  GAA  and 
MCAA  perform  the  same;  see  the  discussion  in  Section  4). 

The  idea  is  the  following.  Suppose  we  run  the  MCAA  {Yk}  for  N  iterations  at  tem¬ 
perature  T,  and  consider  the  GAA  {Xk }  also  at  temperature  T.  Then  since  the  limit  diffusion 
Y(#)  for  { Yk }  is  a  linearly  time-scaled  version  of  the  limit  diffusion  X(*)  for  {Xk }  with  scale 
factor  P(T),  the  suggestion  is  to  run  the  GAA  {Xk )  for  N(T)  =  P(T)N  iterations  at  tempera¬ 
ture  T  to  compensate  for  the  time  scaling.  Now  we  still  need  to  choose  the  parameter  a  in 
the  variance  of  the  MCAA.  We  do  this  by  choosing  p(To)  =  1,  i.e.,  we  choose  the  GAA  and 
MCAA  to  run  at  the  same  time  scale  at  the  initial  (high)  temperature  %  alue  To  (this  choice  is 
somewhat  arbitrary  but  avoids  introducing  additional  parameters).  Hence  a=2T0D  and  so 
p(T)  =  Tq/HT  and  thus  N(T)  =  (To/T)N,  and  we  run  the  GAA  {Xk }  for 

Nj  =  p^No 

iterations  at  temperature  Tj  =  p^To-  From  a  practical  point  of  view,  we  may  impose  a  ceiling 
on  the  number  of  iterations  the  GAA  can  make  at  any  temperature. 

The  basic  structure  of  the  MCAA  methodology  now  carries  over  to  a  GAA,  except  that 
the  MCAA  uses  a  fixed  number  of  iterations  at  each  of  a  geometrically  decreasing  sequence 
of  temperatures,  while  the  GAA  uses  a  geometrically  increasing  sequence  of  iterations  at  a 
geometrically  decreasing  sequence  of  temperatures. 

To  apply  the  MCAA  methodology  to  GAA  it  is  desirable  to  find  a  good  estimate  of  the 
acceptance  probability,  which  is  used  to  determine  the  initial  and  final  temperatures. 
Clearly,  it  is  not  desirable  to  estimate  the  acceptance  probability  via  a  Monte  Carlo  simula¬ 
tion  of  a  MCAA  (in  addition  to  the  GAA).  Now  in  view  of  the  limit  diffusion  analysis,  the 
appearance  of  the  gradient  term  in  the  GAA  can  be  viewed  as  a  local  approximation  in  a  cer¬ 
tain  MCAA.  This  approximation  is  possible  because  of  the  (assumed)  smoothness  in  the  cost 
function  and  the  smallness  of  the  step  size,  and  should  result  in  significant  computational  and 
performance  advantages  for  GAA.  We  shall  next  discuss  how  to  make  some  other  local 
approximations  in  the  MCAA  to  facilitate  estimation  of  the  acceptance  probability,  which 
should  make  for  more  efficient  determination  of  the  initial  and  final  temperatures. 

The  acceptance  probability  at  temperature  T  is  given  by 

PA(T)  =  jVt(T,x)PA(T|x)dx 


where 


Pa(T|  x)  =  Js(T,x,y)q(x,y)dy 

is  the  conditional  probability  of  accepting  a  candidate  move  given  the  current  state  is  x.  We 
develop  an  approximation  to  PA(T)  as  follows.  Substituting  for  q(v)  from  Equation  (3.2) 
and  setting  a  =  2TqD  we  can  write 
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PA(T|x)  =  ^£pA(T|x,i) 


where 


PA(T|x,i)  =  Js(T,x,y)N(xi,2T0D^)(yi)n5(yj  -  xj)dy 

is  the  conditional  probability  of  accepting  a  candidate  move  given  the  current  state  is  x  and 
coordinate  i  is  selected  for  perturbation.  Fix  T,  x  and  i  for  the  moment  and  let 

x  =  (x i , . . .  ,Xj_i,yi,Xj+} . xD) 


Then 


PA(T|x,i)=jexp 


[U(x)-U(x)]+ 

T 


N(xj ,  2T0Dp.)(yi  )dyj 


We  estimate  PA(T|  x,i)  by 


PA(Tjx,i)  =  jexp 


[Ux.(x)(yi-xi)+Ux.x.(x)(yi-Xi)2/2]+ 

T 


•  N(xi,2T0D4)(yi)dyi 


It  is  possible  to  work  out  expressions  for  PA(T|x,i)  in  terms  of  the  exponential  and  error 
functions.  Finally  we  estimate  PA(T)  by 

-  i  Nm. 

(3-3)  PA(T)=-l-£PA(T|Xk) 

k=1 

PA(T|x)  =  -i£PA(T|x,i) 

u  i=i 

where  the  average  in  the  first  equation  is  computed  over  the  N(T)  iterations  of  GAA  at  tem¬ 
perature  T. 

A  summary  description  of  the  proposed  methodology  for  the  GAA  is  given  below. 

Gradient  Annealing  Algorithm  Methodology  Input  parameters:  po  (initial  acceptance 
probability),  pp  (final  acceptance  probability),  p  (geometric  ratio  in  temperature  schedule). 
No  (number  of  iterations  at  initial  temperature),  e  (termination  threshold) 

A 

1.  Find  initial  temperature  T0  such  that  PA(T0)  =  Po  (Pa (To)  given  by  Equation  (3.3)). 

2.  Set  j  =  0. 
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3.  Run  the  modified  gradient  algorithm  Nj  =  p^No  iterations  at  temperature  Tj  =  p>To 

4.  Let  X*  be  the  best  solution  found  through  temperature  Tj 

If  PA(Tj)  s  pf  and  U(X*)  s  U(X^5)  -  e 

then  terminate  die  search  and  output  Xj  and  U(X* ) 

else  set  j  =  j+1  and  go  to  3.  □ 

4.  CONCLUSIONS.  In  this  paper  we  have  developed  a  methodology  for  GAA  based 
on  the  relationship  between  GAA  and  MCAA.  The  idea  here  is  that  GAA  and  a  certain 
MCAA  have  diffusion  limits  which  are  linearly  time-scaled  versions  of  each  other,  which 
suggests  that  a  GAA  methodology  can  be  obtained  from  a  MCAA  methodology  by  correct¬ 
ing  for  the  time-scaling.  This  approach  allows  us  to  associate  the  idea  of  acceptance  proba¬ 
bility  with  GAA,  a  quantity  which  plays  a  critical  role  in  temperature  schedules  for  most 
practical  annealing  schemes.  We  also  show  how  to  make  some  local  approximations  which 
facilitate  better  estimation  of  the  acceptance  probability. 

The  experimental  evaluation  of  the  proposed  GAA  methodology  is  currently  being 
undertaken.  One  interesting  comparison  would  be  between  GAA  and  MCAA.  Our  intuition 
is  that  GAA  will  do  a  better  job  of  finding  a  global  minimum  than  MCAA  for  sufficiently 
smooth,  well-behaved  cost  functions.  For  this  to  make  sense,  the  implicit  assumption  that 
we  are  making  is  that  GAA  and  MCAA  are  close  enough  to  their  diffusion  limit  to  have  a 
similar  methodology  (i.e.,  structure  of  their  temperature  schedule  and  termination  criterion), 
but  far  enough  from  their  diffusion  limit  to  have  distinctly  different  performance  on  certain 
problems. 
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ABSTRACT 

Transient  response  analysis  of  a  hollow  long  cylindrical  section  subjected  to  an  internal 
explosive  blast  has  been  conducted  from  a  structural  integrity  standpoint.  The  reflected 
blast  overpressure  upon  the  internal  surface  of  the  cylinder  was  estimated  based  on  a  cube 
root  scaling  law  and  the  modified  Friedlander  exponential  decay.  A  closed  form  solution  of 
the  governing  equation  of  motion  for  the  internally  pressurized  cylinder  was  obtained  subject 
to  initial  conditions.  The  solution  was  optimized  by  a  trial  and  error  approach  to  yield  an 
optimum  time  which  was  employed  to  predict  the  peak  response. 

INTRODUCTION 

There  is  a  continuing  need  for  modeling  the  structural  damage  due  to  explosive  blast 
to  ensure  structural  integrity  and  to  rationally  provide  hardening  for  such  structures.  This 
has  been  a  subject  of  long  standing  interest  for  the  U.S.  Army  and  the  Ballistic  Research 
Laboratory  (BRL).  A  number  of  studies  have  been  performed  and  damage  data  [1-6]  gathered 
over  the  past  several  years.  However,  most  data  available  at  present  are  in  the  form  of  impulse 
correlation  curves  and  residual  deformation  and  relatively  little  has  been  devoted  to  transient 
response  studies  due  to  explosive  blast  effects  on  structures  at  high  strain  rates. 

Recently,  computation  using  hydrodynamic  codes  for  shock  wave  propagation  and  struc¬ 
tural  loading  estimation  [7-9]  has  been  reported.  Unfortunately  the  application  of  such  codes 
to  generate  loading  and  subsequent  coupling  with  structural  response  prediction  codes  is 
rather  expensive  and  laborious.  Such  procedures  are  justifiable  when  a  high  degree  of  accu¬ 
racy  is  needed  in  modeling  complex  problems  involving  large  deformation  and  nonlinearity. 
A  detailed  analysis  with  coupled  codes  is  beyond  the  scope  of  the  current  investigation  due  to 
time  and  cost  constraints.  This  investigation  involves  the  development  of  a  simplified  model 
and  formulation  of  the  equation  of  motion  as  well  as  subsequent  effort  at  a  closed-form  so¬ 
lution  based  on  simplifying  assumptions  and  previous  work  on  blast  loaded  plates  [10-19]  to 
obtain  optimum  structural  response  and  critical  time  of  occurrence  for  the  structure. 


ESTIMATION  OF  TRANSIENT  LOADS 


The  transient  loads  were  estimated  under  the  assumption  that  the  detonation  of  the 
explosive  would  generate  a  blast  wave  which  would  impose  an  uniform  reflected  overpressure 
on  the  internal  surface  of  a  long  cylindrical  hollow  tube  section  with  open  ends.  Although 
some  variation  in  internal  pressure  peaks,  arrival  and  duration  times  at  different  locations  of 
the  section  can  be  expected  due  to  variation  in  scaled  distance  from  the  point  of  detonation 
to  the  surface  of  the  cylinder,  for  the  sake  of  simplicity,  this  variation  has  been  ignored  in 
the  current  investigation. 

For  the  estimation  of  reflected  overpressure  loading,  a  conservative  cube-root  scaling 
law  [18,19]  is  employed  to  compute  the  scaled  distance,  z,  from  the  charge  location  to  the 
nearest  location  of  the  wall  in  the  form 


2  =  R/W'f  (1) 

where  VV £  is  the  equivalent  charge  weight  and  R  is  the  distance  of  the  nearest  point  of  the 
wall  from  the  charge  location. 

Once  the  scaled  distance  is  known,  the  reflected  parameters  such  as  peak  overpressure, 
impulse,  duration  time  and  time  of  arrival  of  the  blast  wave  could  be  estimated  from  compiled 
air  blast  tables  or  the  Blast  code  [20-23].  The  decay  of  the  reflected  overpressure  is  assumed 
to  obey  the  modified  Friedlander  exponential  decay  equation  which  can  be  written  as 


P(t)  =  P„[l  -  t/tple-V'-  -  (2) 

where  tp  is  the  positive  pressure  phase  duration  of  the  impulse  measured  from  the  time  of 
arrival  of  the  blast  wave  front  at  the  nearest  internal  cylindrical  wall  location,  Pm  is  the  peak 
reflected  pressure  in  excess  of  the  ambient  condition  and  t  is  the  total  elapsed  time. 

The  total  reflected  impulse  imposed  on  the  surface  of  the  cylinder  can  be  obtained  by 
integrating  the  pressure  with  respect  to  time  from  initial  up  to  the  positive  phase  duration 
as  shown  below 


Ir=[hP(t)dt  (3) 

Jo 

where  we  have  tacitly  assumed  that  time  zero  is  the  initial  time  when  the  blast  wave  reaches 
the  internal  wall  such  that  arrival  times  could  be  ignored.  Upon  substituing  the  expression 
for  P(t)  from  equation  (2)  in  above  and  performing  the  integration  as  indicated  earlier  results 
in 


Ir  =  Pmtp[a  -  1  +  e'a']f [a'Y  (4) 

where  Ir  is  the  reflected  impulse.  Using  the  value  of  z  from  equation  (1),  one  can  obtain 
the  values  for  /r,  Pm  and  tv  from  Reference  [21].  The  value  of  a'  which  is  the  decaying 
exponent  can  be  determined  using  the  above  three  quantities  in  equation  (4)  by  a  trial  and 
error  implicit  solution.  When  a'  is  determined,  the  complete  reflected  pressure-time  loading 
history  upon  the  structure  is  known. 
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Fig.l.  Free  body  diagram  of  a  pressurized  cylindrical  section. 

FORMULATION  OF  THE  EQUATION  OF  MOTION 

The  equation  of  motion  for  a  hollow  cylindrical  section  subjected  to  an  internal  blast 
from  the  detonation  of  a  symmetrically  located  charge  weight  can  be  obtained  by  considering 
the  free- body  diagram  in  Figure  1. 

Imposing  a  balance  of  forces  in  the  radial  direct  m  yields  the  governing  relationship  for 
the  cylindrical  section  as  follows: 

P{t)R6  —  crgghsin{0)  =  Md2uTjdt 2  (5) 

where  P(t)  is  the  internal  transient  pressure,  R  is  the  internal  radius  M  is  the  mass  per 
unit  length  given  by  M  —  p  R  9  h,  p  is  the  mass  density  of  the  material  from  which  the 
cylinder  has  been  fabricated,  d2uT!dt2  is  the  acceleration  of  the  differential  section  in  the 
radial  direction,  ogg  is  the  circumferential  stress  in  the  hoop  direction  and  the  remaining 
variables  are  defined  in  Figure  L  To  obtain  the  transient  structural  response,  the  differential 
equation  above  needs  to  be  reduced  to  a  form  amenable  to  a  closed-form  solution  subject  to 
appropriate  initial  and  boundary  conditions. 

For  a  differential  c:_'umferential  segment  as  sin(O)  approaches  0  which  can  be  replaced 
by  <10  in  the  differential  form,  equation  (5)  could  be  simplified  in  the  form: 

d2Uridt2  +  (r6el[pR)  =  P(t)l{ph)  (6) 

In  order  to  solve  the  above  equation,  we  need  to  obtain  expressions  for  Ogg  and  P(t)  as 
a  function  of  ur. 
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The  strain-displacement  relations  for  the  normal  components  of  a  long  hollow  cylinder 
with  axisymmetric  loading  are  given  as 


trr  ~  OUr/dR, 

(99  =  Vr/R,  (7) 

and ,  ett  —  0. 

Assuming  linear  elastic  conditions  and  noting  that  volumetric  stress  in  the  radial  direc¬ 
tion  is  rather  small  for  the  long  hollow  cylinder,  an  expression  for  volumetric  strain  in  the 
hoop  direction  can  be  derived  as 


(99  =  -[(1  ~  u)Iv\(tt  (8) 

Invoking  Hooke’s  Law,  circumferential  stress  in  the  cylinder  could  be  obtained  as 

<790  =  [E/(  1  +  I/)(l  ~  2 !/)][// Crr  +  (1  -  v)egg]  (9) 

where  err  and  egg  are  volumetric  strain  in  the  radial  and  circumferential  directions  respec¬ 
tively,  v  is  Poisson’s  ratio  and  E  is  the  modulus  of  elasticity  of  the  material.  Substituting  eTT 
and  egg  from  equations  (7,8)  in  above  results  in  an  expression  for  the  circumferential  hoop 
stress  as  a  function  of  the  radial  displacement  and  the  internal  averaged  radius  which  could 
be  given  as 


Ogg  =  Eur/[R(  1  -  v2)  (10) 

Substitution  of  equation  (2)  and  above  in  the  equation  of  motion  developed  in  the  previous 
section  results  in  a  differential  equation  of  motion  in  the  form 

dul/dt2  +  uj2ut  =  (a  —  0t)e~lt  (11) 

where, 

w2  =  E/[pR2(l  -  v2} 
a  =  PJ(ph) 

0  =  Pm/(phtp )  (12) 

7  =  a1  ftp 

METHOD  OF  SOLUTION 

The  governing  equation  of  motion  for  the  cylinder  wall  must  be  solved  subject  to  initial 
conditions  that  both  initial  displacement  and  velocity  are  zero  at  time  t  =  0.  Let  us  assume 
a  trial  solution  for  radial  displacement  of  the  form 


ur(t )  =  Asinu>t  +  Bcosut  +  (C  —  Dt)e  yt 


(13) 
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Substituting  the  trial  solution  above  in  equation  (11)  and  setting  the  coefficient  of  the 
functions  e-'1'*  and  to  zero,  the  constants  C  and  D  can  be  evaluated  to  obtain 


uT(t)  =  Asinuit  +  Bcosut  +  (l/£)[a  —  27 Q/ p  —  fit\e  '**  (14) 

where, 

C  =  7J+u>2  (15) 

Subsequently,  the  initial  conditions  are  applied  in  the  equation  above  to  evaluate  the 
constants  A  and  B  which  yields 


£ur(£)  =  [£  +  /3 / 7K7 /(jj)sirujt  —  Scosut  +  (6  —  0t)e  ",t  (16) 

where, 

6  =  a- 27/3/C  (H) 

The  above  equation  computes  in  a  closed-form  the  time  dependent  radial  displacement  vari¬ 
ation  of  the  cylinder  as  a  function  of  the  geometric  and  material  parameters  as  well  as  the 
transient  internal  pressure  load  due  to  detonation  of  an  explosive. 

To  obtain  peak  radial  displacement  of  the  cylinder,  the  time  of  occurrence  of  the  peak 
radial  response  must  be  computed.  Taking  derivative  of  the  radial  displacement  with  respect 
to  time  and  setting  it  to  zero  ensures  that  the  displacement  attains  an  optimum.  This 
operation  ensures  that  the  radial  displacemnt  is  an  optimum  at  an  elapsed  time  provided  the 
radial  velocity  is  zero.  However,  to  guarantee  that  the  displacement  is  in  fact  a  maximum, 
the  radial  acceleration  should  also  be  negative.  The  radial  acceleration  can  be  obtained 
by  differentiating  the  velocity  again  with  respect  to  time  which  should  vanish  when  radial 
velocity  attains  a  peak  value. 

DETERMINATION  OF  PEAK  STRESS 

Using  the  definitions  of  a$g  from  equation  (10)  and  u>2  from  (12),  it  can  be  easily  seen 

that 


age  =  pFLj2uT 

Substituting  ur  from  equation  (19)  in  above  yields 


(18) 


age  =  pRuj2/£[(6  -f  0/^)(^/ix>)sinu>t  —  6cosu>t  +  (<5  —  &t)e  ^  (19) 

The  above  equation  can  be  used  to  determine  the  maximum  hoop  stress  that  the  cylin¬ 
drical  wall  will  experience  in  case  of  an  accidental  explosion.  However,  prior  to  peak  stress 
computation,  the  time  of  occurrence  of  the  peak  response  must  be  found.  Evaluating  the 
derivative  of  age  with  respect  to  time  at  the  optimum  time  of  occurrence  and  setting  the 
resulting  expression  to  zero,  we  obtain 
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(<*7  +  P)co$(u}tm)  +  6usin{utm)  -  [7(6  -  0tm)  +  0\e 


(20) 


-'T<m  =  0 

Above  equation  can  be  solved  implicitly  for  the  optimum  time,  tm,  when  hoop  stress 
becomes  a  maximum.  To  ensure  that  the  stress  is  indeed  a  maximum,  the  left  hand  side 
expression  in  above  must  be  differentiated  again  with  respect  to  time  and  evaluated  to  ensure 
that  the  double  derivative  of  <r99  with  respect  to  time  is  negative  at  the  optimum  time. 

Once  the  value  of  the  optimum  time,  tm  is  known,  it  can  be  substituted  in  equation 
(19)  to  obtain  the  maximum  hoop  stress  in  the  form 

opeak  =  pEu}2lC,[{6  +  /?/7)(7 /w)«n(«lm)  -  6cos(u>tm)  +  (6  -  (21) 

Peak  radial  displacement  can  be  predicted  by  substituting  the  optimum  time  of  occur¬ 
rence  in  equation  (17)  which  yields 

(uT)Peak  =  (1/OP  +  Ph)(~ilu>)sin{utm)  -  6cos(uitm)  +  (8  -  0tm)e~ytm]  (22) 

Once  peak  radial  displacement  is  known,  peak  radial  velocity  and  acceleration  could  be 
determined  by  taking  single  and  double  derivatives  of  the  displacement  given  in  equation 
(17)  and  replacing  t  by  tm  in  the  corresponding  terms. 
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DEVELOPMENT  OF  A  COMPUTATIONAL  METHOD  FOR  CONVENTIONAL 
WEAPONS  ANALYSIS  OF  BURIED  STRUCTURES 
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Vicksburg,  MS  39180-6199 


ABSTRACT.  Current  methods,  including  Army  Technical  Manual  (TM)  5-855- 
1,  of  designing  buried  structures  to  withstand  the  effects  of  the  detonation 
of  an  earth  penetrating  conventional  weapon,  utilize  decoupling  assumptions  in 
determining  the  loads  on  those  structures.  Free-field  stresses  and  velocities 
at  the  structure  location  are  computed  and  then  assumptions  are  made  to 
convert  these  stresses  to  structure  loads.  A  recent  series  of  tests,  CONWEB, 
sponsored  by  the  Defense  Nuclear  Agency,  emphasized  the  need  for  better 
methods  of  predicting  loads  on  structures. 

The  most  serious  drawback  of  any  of  the  current  methods  is  the  decoupling 
of  the  free-field  stress  calculation  from  the  structure  loading  and  structure 
response  calculations.  Therefore,  an  analysis  method  is  needed  in  which  the 
detonation  of  the  explosive,  the  propagation  of  stresses  through  the  soil,  the 
soil-structure-interaction  and  the  structure  response  are  included  in  a  single 
calculation.  The  development  of  this  fully  coupled  analysis  method  is 
presented  in  this  paper. 

The  Finite  Element  (FB)  method  is  an  excellent  method  for  attempting  to 
perform  this  fully  coupled  analysis.  A  constitutive  model  capable  of 
functioning  in  the  very  high  stress  region  near  the  charge,  and  a  method  of 
stabilizing  the  solution,  without  adversely  affecting  the  results,  are  needed. 
This  model  and  method  were  developed  and  implemented  in  an  FB  computer  code. 

A  nonreflecting  boundary  method  capable  of  functioning  extremely  well  in 
regions  of  nonlinear  response  was  also  developed.  This  boundary  is  used  to 
reduce  the  volume  of  soil  which  needs  to  be  modeled  and  makes  the  fully 
coupled  analysis  method  practical. 

This  paper  presents  comparisons  of  the  results  of  analyses  performed 
using  this  newly  developed  method,  to  free-field  stress  and  velocity  data 
measured  in  tests  in  sand  and  clay  in  the  CONWEB  test  series.  Analyses  using 
this  method  did  a  good  job  of  predicting  those  free-field  stresses  and 
velocities.  Analyses  performed  to  assess  the  new  nonreflecting  boundary 
demonstrated  that  this  boundary  functions  extremely  well.  This  analysis 
method  has  been  implemented  in  an  FE  computer  code  capable  of  modeling  the 
soil-etructure-interaction  and  structure  response,  and  will  be  extremely 
useful  in  developing  new  methods  of  designing  buried  protective  structures. 

The  use  of  this  new  method  will  result  in  tremendous  savings  in  the 
construction  of  this  type  of  structure. 
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BACKGROUND .  Tha  military  has  aaaata  which  ara  critical  to  its 
mission  and  must  bs  capable  of  surviving  a  convantional  wsapons  attack. 
Quits  oftsn  this  objective  is  mat  by  placing  thasa  assats  in  a  buriad 
reinf oread  concrate  structura  such  as  tha  ona  shown  in  Figure  1.  Thasa 
struct uras  ara  extremely  axpansivs  and  afforts  must  ba  mad a  to  minimize 
costs . 


Figure  1.  Typical  buried  hardened  structure 

Damage  to  a  buriad  structure  from  a  conventional  weapon  detonation 
is  usually  highly  localized.  One  means  of  reducing  costs  is  to  accept 
moderate  to  high  levels  of  localized  damage,  as  long  as  structural 
failure  does  not  occur.  Accurate  design  procedures  are  needed  to  ensure 
against  failure  without  being  overly  conservative;  since  this  increases 
costs  and  defeats  tha  purpose  of  allowing  localized  damage. 

CURRENT  ANALYSIS  PROCEDURES .  Army  Technical  Manual  (TK)  5-8S5-1 
[1]  recommends  the  use  of  a  sing le-degree-of -freedom  (5DOF),  spring- 
mass,  system  to  analyze  the  structural  element  being  designed. 
Procedures  are  outlined  to  determine  the  equivalent  SDOP  system  to  model 
one-way  and  two-way  slabs  with  various  end  fixity  conditions.  Equations 
and  figures  are  presented  to  determine  the  capacities  of  these  slabs  and 
for  computing  stress  and  velocity  histories  in  the  soil.  Methods  of 
determining  the  load  on,  and  response  of  the  structure,  as  ’.-fell  as 
recommendations  on  acceptable  response,  are  provided.  The  designer  can 
select  a  trial  structural  geometry  and  very  quickly  evaluate  it  using 
the  guidance  provided  in  TM5-855-1. 

An  accurate  determination  of  the  structure  load  is  critical  to 
designing  the  structure.  The  structure  loading  is  a  function  of  the 
stresses  propagating  through  the  soil  and  the  interaction  of  the  soil 
with  the  structure.  Thus,  the  structure  loading  is  coupled  to  the 
structure  response. 

TM5-855-1  recommends  a  decoupled  analysis.  The  loads  on  the 
structure  are  determined  by  modifying  the  free-field  stresses  (the 
stresses  which  would  have  been  present  in  the  soil  at  the  structure 
location  if  the  structure  was  absent)  and  these  loads  are  applied  to  the 
SDOF  model  to  determine  the  structure  response.  A  semi -empirical 
procedure  is  used  to  determine  the  structure  loading  from  the  free-field 
stresses.  Elements  of  linear  elastic  wave  theory  combined  with 
experimental  data  were  used  to  develop  the  procedure  used  to  compute  the 
structure  load  from  the  free-field  stress.  The  peak  structure  load  is 
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linearly  to  the  free-field  stress  in  a  time  equal  to  twelve  transit 
times  of  a  stress  wave  through  the  thickness  of  the  wall  or  roof  slab 
being  analyzed. 

An  Air  Force  protective  construction  manual  [2]  recommends  a 
procedure  very  similar  to  the  one  used  in  TM5-855-1.  The  major 
difference  in  the  two  procedures  is  the  method  of  computing  the  loads 
from  the  free-field  stresses.  The  decoupling  procedure  recommended  in 
this  manual  uses  linear-elastic  1-D  wave  theory  to  determine  the 
structure  loads.  Assuming  continuity  of  stress  and  velocity  at  the 
soil-structure  interface,  the  structure  interface  loading,  s^,  is  given 
by: 


3i  =  9ff  +  <Vff  -  Vg)  eqn.  1. 

where:  sf£  is  the  free-field  stress,  qC  is  the  acoustic  x  Dedance  of  the 
soil,  and  and  va  are  the  free-field  and  structure  velocities, 

respectively.  This  will  be  referred  to  as  the  soil-medium-interaction 
{SMI)  method  of  determining  the  load  on  the  structure.  This  equation  is 
further  simplified  using  the  linear  elastic  1-D  relationship: 

s ^ ^  —  qC  eqn .  2 . 

Substituting  equation  2  into  equation  1  results  in  the  modified  SMI 
(MSMI )  equation: 

s i  —  2  sff  -  qC  Va  eqn.  3. 

Either  eqn.  1  or  3  is  used  to  compute  the  structure  loading  as  long  as 
the  result  is  a  positive  (compressive)  loading  on  the  structure.  Since 
the  soil-structure  interface  cannot  support  tension,  the  loading  is  set 
equal  to  zero  when  eqn.  1  or  3  predicts  tension. 

The  SMI,  MSMI,  and  TM5-855-1  procedures  for  computing  structure 
load  have  been  evaluated  [3]  against  experiments  conducted  for  the  Air 
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Figure  2.  Typical  comparison  of  measured  to  predicted  loads 


Figure  3.  Configuration  for  CONWEB  experiments 
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Two  of  theaa  experiments.  Figure  3,  were  identical,  except  that 
one  was  buried  in  wet  clay  while  the  other  was  buried  in  dry  sand.  The 
structure  buried  in  the  wet  clay  was  severely  damaged  (19  in.  deflection 
on  a  43.2  in.  span).  The  damage  to  the  structure  in  sand  was  minimal 
(1.4  in.def lection) . 

Figure  4  compares  the  early  time  loading  at  the  centers  of  the 
slabs.  This  figure  shows  that  the  peak  loading  at  the  center  of  the 
slab  buried  in  clay  is  much  higher  than  that  of  the  slab  buried  in  sand. 
The  duration  of  this  initial  loading  is  much  higher  in  the  sand, 
resulting  in  an  initial  impulse  that  is  approximately  the  same  as  for 
the  structure  buried  in  clay.  Since  the  initial  loadings  are 
approximately  the  same,  this  indicates  that  the  late-time  loading  on  the 
slab  is  responsible  for  the  difference  in  response  and  late-time  loads 
are  important . 

These  analyses  of  experiments  have  indicated  that  late  time  loads 
on  buried  structures  significantly  affect  the  response  of  the  structure 
and  that  neither  method  of  predicting  loads  on  the  structure  adequately 
predicts  these  late-time  loads.  Therefore  an  improved  method  of 
predicting  these  late-time  loads  is  needed.  A  study  was  initiated  to 
determine  a  better  method  of  accounting  for  soil-structure-interaction 
(SSI)  effects  when  predicting  structure  loads  from  free-field  stresses. 


Time,  msec 


Figure  4. 


Comparison  of  early-time  loads  in  clay  and  sand 


WEED  FOR  FULLY  COUPLED  ANALYSIS  PROCEDURE.  In  order  to  develop  a 
better  method  of  decoupling  the  structure  response  analysis  from  the 
free -fie Id  stress  analysis,  an  analysis  procedure  which  does  not  require 
decoupling  is  needed.  One  possibility  is  to  use  a  "soil  island".  The 
structure  and  a  small  "island”  of  soil  around  it  are  modeled  in  a  finite 
element  (FE)  analysis.  Stresses  and/or  velocities  are  input  on  the 
boundary  of  the  "soil  island". 

This  method  has  the  advantage  that  it  does  not  require  assumptions 
to  decouple  the  soil  from  the  structure  and  it  does  not  require  that  the 
detonation  of  the  explosives  be  included  in  the  analysis.  The  drawback 
to  this  procedure  is  that  it  requires  the  introduction  of  an  artificial 
boundary,  the  soil  island  boundary.  This  boundary  does  not  affect  the 
very  early  loads  on  the  structure ,  but  does  affect  the  late-time  loading 
since  the  boundary  must  be  placed  between  the  explosive  and  the 
structure  and  the  explosive  is  very  close  to  the  structure.  Since  the 
purpose  of  this  study  is  to  study  late-time  loads,  the  soil  island 
approach  is  not  appropriate. 

The  only  way  to  avoid  adding  a  boundary  which  affects  the  loading 
on  the  structure  is  to  include  the  detonation  of  the  explosive  and  the 
structure  response  in  the  same  analysis.  Analyses  of  this  type  have  not 
been  reported  in  the  literature;  therefore,  a  fully-coupled  analysis 
procedure  which  includes  the  detonation  of  the  explosive,  the 
propagation  of  stresses  through  the  soil,  the  SSI,  and  the  response  of 
the  structure  was  developed. 

DEVELOPMENT  OF  FULLY-COUPLED  ANALYSIS  PROCEDURE.  The  development 
of  this  fully  coupled  analysis  procedure  is  presented  in  this  paper. 
Before  this  procedure  can  be  used  to  study  SSI,  it  must  be  shown  that  it 
adequately  predicts  free-field  stresses  and  velocities.  Analyses  were 
performed  for  comparison  with  free-field  stress  and  velocity  data 
collected  in  the  CONWEB  experiments  in  wet  clay  and  dry  sand.  These 
comparisons  are  presented  in  this  paper. 

The  analysis  procedure  must  be  capable  of  adequately  modeling  the 
SSI.  When  a  stress  wave  propagates  through  the  soil  to  the  structure 
there  are  several  possible  occurrences.  Initially  the  soil  will  load 
the  structure  and  the  structure  will  start  to  deform.  The  structure 
could  deform  fast  enough  so  that  the  soil  separates  from  the  structure, 
and  thus  the  structure  is  unloaded.  Later  the  soil  may  catch  up  with 
the  structure  and  reload  it.  It  is  also  possible  that  the  soil  may 
slide  on  the  structure,  and  the  soil  may  flow  past  the  structure. 

There  are  FE  procedures  capable  of  modeling  these  effects.  There 
are  also  FE  methods  for  modeling  the  detonation  of  explosives,  the 
propagation  of  the  stresses  through  the  soil  and  the  response  of  the 
structure.  Thus  it  appears  that  the  FE  method  is  ideal  for  attempting 
to  develop  this  fully  coupled  analysis  procedure. 

There  are  several  considerations  specifically  related  to  the  FE 
method  of  solving  this  problem.  Constitutive  models  in  FE  codes  are 
capable  of  modeling  non-linear  behavior  such  as  the  behavior  of  soil. 
These  models  are  not,  in  general,  capable  of  functioning  in  the  very 


high  stress  region  adjacent  to  the  explosive.  Therefore  it  was  expected 
that  the  constitutive  model  would  nt  i  modification  in  order  to  perform 
the  SSI  study. 

The  SSI  study  will  involve  the  propagation  of  a  very  high  stress 
wave  through  the  soil.  The  extremely  short  rise  time  associated  with 
this  stress  wave  cannot  be  propagated  through  the  FE  grid  which 
represents  the  soil  continuum  as  a  series  of  elements.  Attempting  to 
propage  this  stress  wave  through  the  grid  leads  to  numerical 
instability.  Some  method  of  stabilizing  the  solution  without  adversely 
affecting  the  response  is  needed.  This  is  typically  accomplished  by 
including  artificial  viscosity  forces.  These  artificial  viscosity 
forces  spread  the  shock  front  over  several  elements  and  damp  out  the 
numerical  instabilities.  Procedures  for  stabilizing  the  solution  are 
built  in  to  PE  codes,  but  these  default  procedures  were  not  satisfactory 
for  an  explosive  detonating  is  soil  and  a  modified  method  of  stabilizing 
the  solution  without  adversely  affecting  the  solution  was  developed. 

In  order  to  adequately  predict  the  rise  times  on  the  free-field 
stress  histories,  a  fine  grid  will  be  needed.  The  grid  size  must  be 
limited  so  that  the  run  times  will  be  reasonable.  Nonreflecting 
boundaries  are  available  to  reduce  the  required  grid  size.  These  are 
boundaries  which  are  uead  to  model  an  infinite  amount  of  material  beyond 
the  boundary. 

A  nonreflecting  boundary  was  developed  by  Lysmer  and  Kuhlemeyer 
l 6}  for  dynamic  loadings.  A  similar  procedure  developed  by  Underwood 
and  Geers  [7]  is  suitable  for  both  static  and  dynamic  loadings.  They 
demonstrated  that  these  boundaries  are  extremely  effective  when  the 
boundary  is  placed  in  a  region  of  linear  elastic  response.  In  another 
study,  [8],  they  showed  that  these  boundary  techniques  are  not  effective 
when  the  boundary  is  placed  in  a  region  of  highly  nonlinear  response. 

The  behavior  of  the  soil  will  be  highly  nonlinear  in  a  large 
region  around  the  charge,  and  it  will  be  impractical  to  include  all  of 
the  soil  in  the  SSI  analysis.  Therefore,  a  nonreflecting  boundary 
method  capable  of  functioning  in  the  nonlinear  response  region  of  the 
soil  was  developed. 

The  large  deformation,  large  strain,  explicit,  three-dimensional, 
FE  code,  DYNA3D  [9]  contains  the  Jones  Wilkins  Lee  (JWL)  (10]  equation 
of  state  which  can  be  used  to  model  the  detonation  of  the  explosives. 

The  Cap  model  [11,  12]  is  available  to  model  the  soil,  nonreflecting 
boundaries  are  available,  and  an  interface  routine  which  allows 
separation,  recontacting,  and  sliding  with  friction  of  the  soil  on  the 
structure  is  available.  These  features  make  DYNA3D  an  excellent 
candidate  for  attempting  the  SSI  analyses;  therefore,  it  was  selected 
for  this  study. 

DEVELOPMENT  OP  HIGH-PRESSURE  CONSTITUTIVE  MODEL  FOR  SOIL.  Before 
attempting  the  SSI  analyses  it  should  be  demonstrated  that  the  method  is 
capable  of  adequately  predicting  free-field  stresses  and  velocities. 
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This  requires  that  a  constitutive  model  capable  of  functioning  in  the 
very  high  stress  region  near  the  charge  and  a  method  of  stabilizing  the 
solution  be  developed  and  implemented  in  DYNA3D. 

The  Cap  model  is  an  excellent  model  for  predicting  stress  wave 
propagation  through  soil,  but  the  Cap  model  in  DYNA3D  would  not  converge 
to  a  solution  for  simple  test  problems  using  the  material  properties  of 
the  CONWEB  clay  material.  Therefore,  another  Cap  model  was  obtained 
(13J. 


This  new  Cap  model  functioned  well  for  both  the  CONWEB  clay  and 
sand  materials,  but  would  not  function  in  the  very  high  stress  region 
near  the  charge.  In  this  new  Cap  model  the  volumetric  stress  strain 
relationship  is  divided  into  the  elastic  and  plastic  parts.  The  change 
in  elastic  volumetric  strain  is  the  change  in  pressure  divided  by  the 
constant  bulk  modulus,  K.  At  extremely  high  pressures  in  soils,  the 
bulk  modulus  increases  significantly  with  increasing  pressures. 

Ignoring  this  increase  will  cause  the  model  to  predict  unrealistically 
high  volumetric  strains.  Therefore  the  Cap  model  had  to  be  modified  so 
that  the  bulk  modulus  varied  with  pressure. 

Static  test  data  (14]  at  very  high  pressures  have  shown  that  the 
bulk  modulus  is  adequately  represented  by  the  following  function: 

K  «  KA  +  Kx  P*2  eqn.  4. 

Where  K^,  K^,  and  Kj,  are  material  constants  and  P  is  the  pressure.  The 

new  Cap  model  was  modified  to  use  this  function  for  the  bulk  modulus. 
Analyses  were  then  performed  to  ensure  that  the  modified  Cap  model 
combined  with  the  JWL  equation  of  state  would  satisfactorily  predict 
free-field  stresses  and  velocities. 

DEVE^opMB^'L-Q£-.STA9IU MPTBQ&.  In  the  initial  attempts  to 
validate  the  Cap  model,  the  FE  analyses  were  unstable.  DYNA3D  [15]  uses 
linear  and  quadratic  artificial  viscosity  forces  to  spread  the  shock 
front  over  several  elements.  An  artificial  force,  f,  is  introduced  for 
elements  which  are  compressing.  This  is  the  method  developed  by  von 
Neumann  and  Richtmyer  (16]  and  it  has  been  demonstrated  that  this  method 
stabilizes  the  solution,  but  does  not  perturb  the  solution  away  from  the 
shock  front.  The  artificial  force  is  given  by: 

f  -  ql  fe^J  (Qj  1  (ej^l  +  Q2  c)  eqn.  5. 

Where  Qj_,  and  Q2  are  dimensionless  constants,  q  is  the  mass  density,  1 

is  the  characteristic  length  of  the  element,  c  is  the  sound  speed  for 
the  material,  and  le^l  is  the  magnitude  of  the  volumetric  strain  rate. 

For  relatively  uniform  grids,  it  is  appropriate  to  use  the  cube  root  of 
the  element  volume  for  1,  and  this  is  done  in  DYNA3D .  and  Q2  default 

to  1.5  and  0.06,  respectively,  in  DYNA3D. 
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Figure  5.  Grid  for  explosives  and  surrounding  soil 


In  the  SSI  analyses  a  2-0  cylindrical  grid,  as  shown  in  Figure  S, 
will  be  used  to  model  the  charge  and  surrounding  soil.  Because  of  the 
shape  of  the  elements  and  the  reduction  of  stresses  as  the  shock 
propagates  away  from  the  charge,  it  is  more  appropriate  to  use  viscosity 
forces  which  are  proportional  to  the  radial  dimension  of  the  element, 
rather  than  the  cube  root  of  the  volume. 

Using  the  cube  root  of  the  volume  will  result  in  artificial 
viscosity  forces  which  are  too  low  near  the  charge.  This  effect  can  be 
offset  by  using  values  of  the  constants  and  Q2  which  are  higher  than 

the  default  values.  At  distances  away  further  away  from  the  charge  the 
artificial  viscosity  forces  will  be  too  high,  and  will  affect  the 
predicted  stress  and  velocity  histories.  This  effect  can  be  reduced  by 
modifying  the  method  of  computing  the  characteristic  length  of  the 
element,  or  by  using  artificial  viscosity  terms  which  vary  with  distance 
from  the  charge.  The  latter  method  was  easier  to  implement,  therefore 
it  was  used. 

COMPARISONS  TO  EXPERIMENTAL  FREE-FIELD  DATA.  One-dimens ional 
spherical  analyses  were  performed  for  comparison  with  the  free-field 
data  in  the  CONWEB  experiments  in  clay  and  sand.  A  complete  description 
of  these  analyses  has  been  reported  (17).  In  these  experiments  free- 
field  stresses  and  accelerations  were  measured  at  3,  4,  5,  6,  and  7  ft 
from  the  explosive.  The  acceleration  histories  were  integrated  to 
obtain  the  free-field  velocities. 
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Initially  the  analyses  in  clay -were  attempted  using  the  default 
values  of  the  artificial  viscosity  coefficients,  but  these  analyses  were 
unstable.  The  lowest  viscosity  coefficients  which  produced  a  stable 
solution  were  S  times  the  default  values.  The  predicted  stresses  for 
this  analysis  are  compared  to  data  in  Figure  6.  This  figure  shows  that 
stresses  are  predicted  extremely  well  at  the  3  and  4  ft  ranges.  At  the 
other  ranges  the  peak  stresses  are  well  predicted,  but  the  rise  times 
are  too  long  and  the  late-time  stresses  jure  too  high. 

Figure  7  compares  the  free-field  velocities  to  the  data  for  this 
experiment.  The  velocity  histories  also  agree  reasonably  well  with  the 
data.  The  late-time  velocities  are  slightly  overpredicted  and  the  peak 
velocities  at  the  farther  ranges  are  overpredicted.  Rise  times  to 
maximum  velocities  are  overpredicted.  The  overpredicted  stresses  at  the 
farther  ranges  are  due  largely  to  the  free  surface  which  is  present  in 
the  experiment,  but  not  included  in  the  analysis. 

The  large  rise  times  of  the  stress  and  velocity  histories,  as 
compared  to  the  data,  are  due  to  the  method  which  DYNA3D  uses  to 
stabilize  the  solution.  Rise  times  which  are  too  long  could  affect  the 
loads  transmitted  to  the  structure;  therefore  it  is  important  to  model 
them  reasonably  well. 

This  analysis  was  repeated  using  the  newly  developed  method  of 
stabilizing  the  solution.  This  did  not  affect  the  stress  and  velocity 
histories  at  3  and  4  ft  but  did  affect  those  at  the  farther  ranges.  The 
stress  and  velocity  histories  at  the  S,  6,  and  7  ft  ranges  are  compared 
to  the  original  analysis  in  Figures  8  and  8,  respectively.  These 
figures  show  that  the  stress  histories  do  not  change  significantly, 
except  for  the  rise  times  which  are  significantly  improved  by  using  the 
new  stabilization  method.  The  latter  analysis  does  a  much  better  job  of 
matching  the  rise  times  of  the  experimental  data. 

Analyses  were  also  performed  for  comparison  with  the  CONWEB  sand 
experiment.  These  analyses  could  not  be  performed  using  the  default 
viscosities.  Viscosity  coefficients  of  3  times  the  default  values  were 
needed  to  stabilize  the  solution.  Free-field  stress  and  velocity 
histories  were  predicted  reasonably  well.  In  general,  rise  times  were 
too  long  and  arrival  was  too  late.  Late  time  stresses  and  were 
overpredicted  and  late-time  velocities  were  underpredicted.  The  late¬ 
time  overpredicted  stresses  and  underpredicted  velocities  are  due 
largely  to  the  effects  of  the  free  surface  and  the  finite  test  pit. 

These  effects  were  not  included  in  the  analysis. 

Arrival  times  and  rise  times  at  the  6  and  7  ft  ranges  were 
improved  significantly  by  using  the  modified  method  of  stabilizing  the 
solution.  This  does  increase  the  predicted  peak  pressures  and 
velocities  but  the  overall  effect  of  using  the  newly  developed 
stabilization  procedure  is  to  improve  agreement  between  the  analysis  and 
the  experimental  data. 
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Radial  Stress, 


c.  Range  »  5  ft 


Figure  6-  Stresses  in  clay  (continued) 
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Radial  Velocity,  Ips  Radial  Velocity,  Ips  Radial  Velocity,  ips 


c.  Range  —  5  ft 


Figure  7,  Velocities  in  clay  (continued) 
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Figure  7.  (concluded) 
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NONREFLECTING  BOPMDARI ES~»  Figure  10  shows  the  results  of  several 
analyses  performed  using  a  1  in.  radius  charge  in  clay  in  a  1-0 
cylindrical  geometry.  In  one  case  the  grid  boundary  was  placed  at  40  ft 
from  the  charge,  and  the  effects  of  this  boundary  do  not  appear  in  the 
time  period  shown.  In  the  other  histories,  three  different  types  of 
boundaries  were  placed  at  10  ft  from  the  center  of  the  charge.  This 
figure  shows  that  using  the  nonreflecting  boundary  installed  in  DYNA3D 
produces  about  the  same  stress  history  as  providing  a  fixed  boundary  at 
the  same  location. 

This  boundary  is  based  on  the  elastic  properties  of  the  material 
and  is  obviously  not  effective  when  placed  in  a  nonlinear  response 
region.  The  nonreflecting  boundary  in  DYNA3D  is  based  on  the  work  of 
Lysmer  and  Kuhlemeyer  [7],  and  amounts  to  placing  viscous  springs  at  the 
boundary  to  simulate  the  material  which  would  have  been  on  the  other 
side  of  the  boundary.  The  constants  for  these  viscous  springs  are  based 
on  the  elastic  properties  of  the  material. 


Nonrefiecting 


Time,  msec 

Figure  10.  Effect  of  boundaries  on  stress  history 

Figure  11  shows  the  volumetric  stress-strain  relationship  for  the 
CONWEB  clay.  The  solid  curve  in  this  figure  includes  both  elastic  and 
inelastic  response.  This  curve  shows  a  significant  stiffening  of  the 
material  at  strains  above  4  percent.  At  this  time,  all  of  the  air  voids 
have  been  compressed  out  of  the  soil  and  the  soil  behaves  as  an  elastic 
material.  The  material  will  unload  parallel  to  the  elastic  curve.  The 
dotted  curve  shows  the  elastic  pressure  volumetric  strain  relationship 
at  low  pressures.  At  strains  above  4  percent  the  total  strain  curve  is 
parallel  to  the  elastic  strain  curve. 


The  nonreflecting  boundary  is  based  on  this  elastic  curve,  and  the 
boundary  is  too  stiff  until  the  strains  reach  4  percent.  For  a 
monotonically  decreasing  stress  pulse  similar  to  the  ones  shown  in 
Figure  6,  propagating  into  an  infinite  amount  of  soil,  there  will  always 
be  material  in  front  of  the  shock  which  is  strained  to  less  than  4 
percent,  therefore  the  nonreflecting  boundary  will  always  be  too  stiff. 
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Figure  11.  Hydrostatic  response  of  clay 

The  nonreflecting  boundary  was  modified  to  reflect  the  total 
stress-strain  behavior  of  the  material  instead  of  the  elastic  behavior. 
This  was  done  by  using  a  stress-strain  relationship  like  the  one  shown 
as  the  dashed  curve  in  Figure  11.  This  was  accomplished  by  modifying 
the  subroutine  which  provides  the  material  property  information  to  the 
nonreflecting  boundary  subroutine.  Using  this  method,  the  nonreflecting 
boundary  problem  can  be  corrected  without  affecting  the  material 
constants  used  for  the  wave  propagation  analysis. 

Because  this  problem  involves  a  single  outgoing  monotonically 
decaying  stress  pulse,  this  nonreflecting  boundary  should  perform 
satisfactorily.  A  comparison  of  analysis  results  with  the  boundary  far 
away  to  those  using  the  modified  nonreflecting  boundary  is  presented  in 
Figures  12  and  13  for  stresses  and  velocities,  respectively.  These 
figures  show  that  the  modified  nonreflecting  boundary  is  extremely 
effective. 

Similar  results  were  obtained  when  the  modified  nonref lecting 
boundary  was  used  in  the  sand  backfill  material. 
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Velocity,  Ips  Stress,  psi 


Figure  12-  Stress  histories  using  new  nonreflecting  boundary 


Figure  13.  Velocity  histories  using  new  nonreflecting  boundary 

309 


SUMMARY.  Accurate  design  procedures  are  needed  so  that  the  costs 
of  buried  hardened  structures  can  be  reduced.  The  most  serious  drawback 
of  current  analysis  methods  is  the  method  used  to  decouple  structural 
response  calculations  from  free-field  stress  calculations.  A  fully 
coupled  analysis  procedure  which  includes  the  detonation  of  the 
explosive,  the  propagation  of  stresses  through  soil,  the  soil-structure 
interaction,  and  the  response  of  the  structure  was  needed. 

It  was  decided  that  the  FE  method  was  appropriate  for  attempting 
these  SSI  analyses.  There  were  several  problems  with  existing  FE 
methodology  which  had  to  be  addressed  before  the  FE  method  could  be  used 
for  the  SSI  study.  Typical  constitutive  models  for  soil  do  not  function 
well  in  the  extremely  high  stress  region  adjacent  to  the  charge.  A 
modified  Cap  model  capable  of  performing  immediately  adjacent  to  the 
charge  was  developed.  The  same  constitutive  model  can  also  be  used  at 
much  lower  stress  levels  far  away  from  the  charge. 

The  methods  typically  used  in  FE  codes  to  stabilize  the  solution 
for  shock  wave  propagation  were  not  adequate  for  computing  stress  wave 
propagation  in  cylindrical  and  spherical  grid  geometries  near  the  model 
of  a  explosive  detonation.  A  method  which  stabilized  the  solution, 
without  adversely  affecting  the  problem  solution  was  developed. 

Analyses  were  performed  to  assess  the  modified  Cap  model  and 
stabilization  method.  The  analysis  results  using  these  modifications 
agreed  extremely  well  with  test  data. 

A  nonreflecting  boundary  is  required  to  reduce  the  grid  size  so 
that  run  times  will  be  reasonable.  Currently  availble  boundaries 
function  well  in  regions  of  elastic  response  but  do  not  function  well  in 
regions  of  significant  nonlinear  inelastic  response.  In  the  SSI  study, 
the  boundaries  must  be  placed  in  regions  of  significant  nonlinear 
response.  Therfore,  a  nonreflecting  boundary  routine  which  functions 
well  in  regions  of  nonlinear  behavior  was  developed.  Analyses  performed 
using  this  modified  boundary  showed  that  the  stress  and  velocity 
histories  were  very  similar  to  analyses  in  which  the  boundary  was  far 
away. 


CONCLUSIONS.  The  modified  Cap  model  along  with  the  JWL  equation 
of  state  and  the  new  method  of  stabilizing  the  solution  functioned 
extremely  well.  Predicted  stresses  and  velocities  agreed  reasonably 
well  with  experimental  data  in  both  wet  clay  and  dry  sand.  The  newly 
developed  nonreflecting  boundary  worked  extremely  well  when  placed  in  a 
region  of  highly  nonlinear  response  in  the  soil.  These  methods  were 
verified  in  wet  clay  and  in  dry  sand.  The  response  of  most  soils  will 
fall  between  these  two.  Therefore,  thes  methods  should  perform  equally 
as  well  for  these  intermediate  soils. 

With  the  new  methods  and  procedures  presented  in  this  paper,  a 
fully  coupled  analysis,  including  the  detonation  of  the  charge,  the 
propagation  of  stresses  through  the  soil,  the  soil-structure  interaction 
and  the  response  of  the  structure  is  possible.  This  fully  coupled 
analysis  procedure  can  be  used  to  develop  better  methods  of  predicting 
loads  on  buried  structures.  The  use  of  more  accurate  procedures  for 
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predicting  loads  on  structures  will  result  in  tremendous  cost  savings  in 
the  construction  of  hardened  structures. 


This  method  can  be  used  to  analyze  structures  when  other  methods, 
such  as  the  soil  island  approach  are  not  appropriate.  The  soil  island 
approach  can  only  be  used  if  the  bomb  is  relatively  far  away  from  the 
structure,  or  if  late-time  loads  are  unimportant.  The  newly  developed 
method  can  be  used  to  model  both  early-  and  late-time  behavior.  The  new 
method  is  appropriate  far  charge  standoffs  of  only  several  charqe 
diameters.  Therefore  this  method  is  applicable  for  almost  all 
conventional  weapons  analyses  except  t>  r  contact  bursts.  Since  this  is 
an  FE  method,  a  wide  variety  of  structure  geometries  can  be  modeled. 
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Abstract 

The  numerical  modeling  of  an  explosion  and  the  effects  of  its  pressure  loading  on 
surrounding  structures  involve  several  different  length  scales.  The  failure  of  a  grid  to 
resolve  these  length  scales  can  lead  to  an  inaccurate  solution  which  is  characterized 
by  grid  orientation  effects  and  nonphysical  oscillations  near  wavefronts.  An  adaptive 
grid  which  concentrates  grid  points  in  regions  where  solution  gradients  are  large  can 
minimize  these  types  of  errors.  As  an  application  of  the  use  of  adaptive  grids,  an 
automatic  rezone  scheme  has  been  developed  for  the  two-dimensional  Cartesian  and 
axisymmetric  Culerian  finite-difference  algorithms  in  the  HULL  hydrodynamics  code. 

The  rezone  algorithm  is  completely  automated  with  the  grid  point  locations  based 
on  values  of  the  pressure  and  its  derivatives.  Sample  computations  demonstrate  how 
adaptive  rezoning  can  be  used  to  calculate  more  realistic  solutions  without  increasing 
the  total  number  of  grid  points.  The  examples  include  the  simulation' of  atmospheric, 
underwater,  and  underground  explosions. 

1  INTRODUCTION 

The  success  or  failure  of  a  numerical  simulation  often  depends  on  the  grid  that  is  used.  The 
failure  of  a  grid  to  resolve  all  of  the  significant  length  scales  in  a  problem  will  lead  to  both  an 
inaccurate  and  an  unrealistic  solution.  This  is  especially  true  in  the  modeling  of  explosions 
where  large  solution  gradients  must  be  resolved  by  the  numerical  algorithm.  An  insufficient 
number  of  grid  points  in  the  region  near  a  shock  wave  leads  to  either  oscillations  or  smearing 
in  the  solution.  A  grid  with  uniform  spacing  in  each  coordinate  direction  could  be  used 
for  solving  problems  of  this  type,  but  that  would  be  wasteful  since  there  are  typically  large 
regions  where  the  solution  is  nearly  constant.  It  would  be  desirable  to  have  a  grid  with  a 
high  concentration  of  grid  points  in  regions  where  the  solution  gradients  are  large  and  very 
few  points  where  the  solution  is  nearly  constant.  However,  this  cannot  be  done  a  priori 
since  the  shock  wave  location  is  determined  by  the  solution.  What  is  needed  is  an  adaptive 
rezoning  capability  that  regenerates  the  grid  during  the  computational  procedure  based  on 
the  current  values  of  the  numerical  solution. 

The  purpose  of  this  report  is  to  apply  one-dimensional  concepts  of  equidistribution 
to  rezone  Cartesian  grids  for  solving  multi-dimensional  problems.  The  equidistribution 
principle  has  been  used  extensively  for  solving  one-dimensional  problems.  One  of  the  earlier 
surveys  appeared  in  the  paper  by  Russell  and  Christiansen  [l].  With  a  multi-dimensional 
Cartesian  grid  system,  the  rezoning  can  be  done  directly  without  iteration,  which  is  often 
not  practical  when  applying  analogous  methods  to  general  curvilinear  coordinate  systems. 
The  adaptive  rezoning  method  has  been  used  in  the  solution  of  several  problems  involving 
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the  numerical  simulation  of  explosions.  The  method  has  worked  well  on  these  problems, 
which  were  solved  using  the  HULL  hydrodynamics  code,  and  could  be  applied  to  other 
problems  where  the  grid  must  be  composed  of  horizontal  and  vertical  lines.  The  HULL 
code  has  several  existing  rezone  options,  but  none  of  them  was  applicable  to  the  solution  of 
explosion  problems. 

2  REZONE  ALGORITHM 

Let  us  begin  by  considering  a  two-dimensional  problem  which  is  to  be  solved  using  a  cell- 
centered  finite  volume  scheme.  The  finite  difference  grid  is  composed  of  vertical  and  hori¬ 
zontal  lines 


X  —  2j,  t  —  0,1,--  -  ,  tmAX 
y  ~  Vji  j  =  0)  li  '  ■  *  ,  Jm&x ■ 

The  spatial  intervals  are 

AXj  —  X i  —  I,_i,  i  —  1,  •  •  •  ,  hnu 

~  Vi  ~  Vi-ii  i  —  1» '  ‘ '  t  Jnuuf 


The  values  of  the  numerical  solution  are  defined  at  the  cell  centers  so  that  for  a  typic; 
solution  variable  p,  the  value  in  cell  i,j  is 


Pi,j  =  P(|(x«-  +  *.-t)>  \{Vi  +  Vj- 1))- 


Now  a  new  grid  is  to  be  constructed  based  on  these  solution  values  defined  on  an  existing 
grid.  An  adaptive  grid  must  sense  variations  in  the  solution  and  adjust  the  grid  accordingly. 
Since  there  may  be  several  solution  variables,  more  than  one  adaptive  grid  could  be  con¬ 
structed.  However,  in  explosives  problems,  the  pressure  is  a  key  variable  in  assessing  blast 
effects.  Thus,  the  rezoning  algorithm  used  in  the  later  examples  is  based  on  the  variations 
in  the  pressure. 

Since  the  same  scheme  is  used  to  redistribute  points  in  each  coordinate  direction,  only 
the  redistribution  along  the  x-axis  will  be  explained  in  detail.  Now  any  distribution  of 
grid  points  can  be  viewed  as  being  defined  by  a  continuous  mapping  from  the  parametric 
interval  [0,  imo*j  onto  the  coordinate  interval  [zo,x,mM].  The  parametric  variable  will  be 
denoted  by  £  and  the  grid  points  x,  are  the  images  of  the  points  £  =  i  under  the  mapping 
from  parametric  to  coordinate  variables.  Under  this  terminology,  a  uniform  grid  would  be 
produced  whenever  the  mapping  is  a  linear  function.  But  linear  functions  are  the  general 
solution  of  the  second  order  differential  equation 


The  particular  solution  which  gives  rise  to  the  uniform  grid  would  also  satisfy  the  boundary 
conditions 


x(Q)  =  z0 

z(*mar)  = 
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By  considering  more  general  boundary  value  problems,  methods  can  be  developed  for  gen¬ 
erating  nonuniform  grids.  To  be  more  specific,  nonuniform  grids  will  be  constructed  as 
solutions  of  the  second  order  equation 


d  dx 


where  w  —  w(£)  is  a  weight  function  used  to  control  the  distribution  of  grid  points.  This 
equation  can  be  integrated  once  to  give  the  first  order  equation 


dx 

w  -r~  ss  constant. 

d$ 

The  following  rezone  scheme  will  be  based  on  a  discretization  of  this  first  order  equation. 

Having  motivated  the  concept  of  grid  distribution  from  a  continuous  model,  the  discrete 
problem  of  generating  the  grid  point  locations  will  now  be  addressed.  The  first  consideration 
will  be  the  selection  of  a  weight  function  to  be  used  to  control  the  grid  spacing.  Each  weight 
function  will  control  spacing  in  only  one  direction  and  must  be  a  function  of  a  single  variable. 
Therefore  we  will  start  out  by  defining  a  function  P  by  the  formula 

Pi  =  max  {pi,i  :  1  <  j  <  jmax}  .  (1) 

In  order  that  the  grid  spacing  be  influenced  by  the  values  of  both  p  and  its  derivatives,  the 
weight  function  will  be  given  as 

Wi  =  co\P i\  +  Cl\P'\+c2\P’'\,  (2) 

where  P'  and  P"  denote  the  finite  difference  approximations  of  first  and  second  derivatives. 
The  coefficients  c,  are  included  to  add  flexibility  to  the  scheme.  If  co  is  the  dominant 
coefficient,  then  the  grid  spacing  will  be  smallest  where  the  function  P  is  greatest.  The  cx 
term  causes  grid  points  to  cluster  where  the  derivative  is  largest,  such  as  near  shock  waves. 
The  C2  term  would  cluster  points  where  large  changes  in  the  derivative  occur,  such  as  near 
oscillations  in  the  numerical  solution.  Nearly  all  numerical  algorithms  work  best  when  there 
is  a  smooth  change  in  grid  spacing.  Since  w  is  defined  in  terms  of  P  and  its  differences,  it 
may  change  drastically  from  point  to  point  and  from  time  step  to  time  step.  In  such  cases 
it  is  advisable  to  smooth  the  function  P  before  computing  w.  A  diffusive  smoother  is  used 
which  is  based  on  the  numerical  solution  of  the  heat  equation  on  a  nonuniform  grid. 

The  motivation  for  the  smoothing  formula  will  be  accomplished  by  considering  again 
functions  of  a  continuous  variable.  Suppose  that  P{x)  is  the  initial  value  for  the  solution  of 
the  heat  equation 

du  _  d2u 

Tt~  dx* 

This  equation  can  be  written  in  terms  of  the  parametric  variable  £  as 

,5z.33u  _  dx  d2u  d2x  du 

9t  ~  df  W~  W  3 

Now  consider  the  discretization  of  the  above  parabolic  equation  with  the  time  step  chosen 
locally  so  that  the  maximum  stable  time  step  is  used  at  each  point.  This  is  the  procedure 
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that  was  used  to  arrive  at  the  smoothing  formula  used  in  our  algorithm.  The  value  of  P 
on  the  right  hand  side  of  the  following  equation  is  the  input  grid  function  and  the  value  on 
the  left  is  the  smoothed  value  computed  by  taking  one  time  step  in  the  numerical  solution 
of  equation  (3). 

Pi  =  Pi  +  r(Pi+l  +  Pt.!  -  2 Pi  -  \o (Pi+l  -  Pi-x )) 

where 


T 


a 


2 


Ax;+i  —  Ax,-! 
Azt-+1  -f  Ax,-_i  +  2  A  Xi 


The  parameter  r  is  chosen  from  the  stability  analysis.  Note  that  a  is  due  to  the  nonunifor¬ 
mity  of  the  grid  spacing.  After  each  smoothing  sweep,  the  endpoint  values  are  reset  to  the 
values  of  their  interior  neighbors. 


Pi 

Pi  m»x 


P-i 

PifOAM 


1 


At  least  one  smoothing  step  is  recommended;  however,  it  should  be  noted  that  repeated 
smoothing  will  reduce- the  function  P  defined  in  (1),  and  hence  w,  to  a  constant  which 
would  result  in  a  uniform  grid.  The  smoothing  formula  used  here  has  worked  better  than 
the  usual  averaging  techniques  since  it  takes  into  consideration  the  grid  distribution. 

The  weight  function  defined  in  (2)  is  for  rezoning  in  the  x-direction  and  will  henceforth 
be  denoted  as  wx.  The  value  of  the  weight  function  at  f  =  :  is  indicated  by  wXii.  If  x  is 
replaced  by  y  and  i  is  replaced  by  j,  an  analogous  weight  function  wy  can  be  derived  for 
rezoning  in  the  y-direction.  Now  the  maximum  weight  function  value  in  both  directions  can 
be  computed  as 

Wnu*  =  max  | max  wx,i,  max  |  •  (4) 

A  parameter  u  >  1  will  be  introduced  which  will  determine  the  degree  to  which  the  grid 
adapts  to  the  weight  functions.  In  order  to  see  how  this  is  done,  consider  the  grid  function 

Wi  =  l  +  (w-l)— (5) 
We  are  going  to  rezone  so  that  the  intervals  Ax,-  for  the  new  grid  will  satisfy 


Ax,-  Wi  =  Cx 


(6) 


where  Cx  is  a  constant  to  be  determined.  Now  it  can  be  seen  that  if  wx  is  near  the  maximum 
value  Mm**,  then  Wi  is  approximately  equal  to  u  and  Ax,-  ss  Cx/u-  On  the  other  hand,  if 
wx  is  much  less  than  Wmax,  then  W,  is  nearly  1  and  Ax*  ss  Cx.  Therefore,  when  wx  assumes 
values  near  both  extremes,  the  quantity  w  is  approximately  the  ratio  of  the  maximum  to 
minimum  grid  spacing.  To  determine  the  constant  Cx,  note  that 


<rn*x  l>nu  /"*  *m*s 

I0  =  £a 


1 


t=i 


«=i 
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which  implies  that 


The  new  grid  intervals  along  the  z-axis  can  be  computed  from  (6)  as 


with  Cx  given  in  (7)  and  W,  given  in  (5),  and  the  coordinates  along  the  axis  given  as 

t 

x{  =  ^2  Aar*. 

fe=i 

A  new  distribution  of  grid  points  along  the  y-axis  can  also  be  computed  using  the  following 
analogous  formulas. 

i 

yj  =  53 
*=1 


imti  j 

Cy  ( yjmmx  “  Vo')  ^  ]  jir 
1  j. 

Wj  =  l  +  (w-l)-SbL. 

^IMI 

3  COMPUTATIONAL  RESULTS 


The  adaptive  rezoning  procedure  was  used  in  the  solution  of  several  different  types  of  prob¬ 
lems  involving  the  modeling  of  explosions.  In  all  cases  the  grid  was  dynamically  coupled 
with  the  solution  so  that  the  grid  moved  at  every  tenth  time  step.  The  choice  of  ten  time 
steps  between  updates  of  the  grid  was  chosen  mostly  by  experience.  It  is  often  enough  so 
that  the  grid  is  able  to  keep  up  with  the  solution.  It  is  possible  to  rezone  at  each  time  step, 
but  that  is  not  advisable  for  two  reasons.  First  of  all,  the  computation  time  increases  sig¬ 
nificantly,  and  secondly,  the  rezone  procedure  tends  to  smooth  the  solution  so  that  pressure 
peaks  are  excessively  damped.  The  rezone  procedure  was  most  successful  in  improving  the 
qualitative  nature  of  the  numerical  solution.  It  greatly  reduced  the  oscillations  (or  ringing) 
in  the  solution  without  the  addition  of  artificial  viscosity.  The  adaptive  grid  algorithm  was 
implemented  with  w  =  5  to  give  a  ratio  of  five  for  the  maximum  to  minimum  grid  spacing. 
This  had  the  effect  of  reducing  the  grid  spacing  in  the  neighborhood  of  the  shock  by  a  factor 
of  approximately  three. 

The  gridding  scheme  was  capable  of  producing  extremely  fine  grids  near  the  shock,  but 
the  HULL  code  documentation  [2]  recommended  that  the  grid  aspect  ratio  not  exceed  three. 
This  ratio  was  exceeded  on  some  of  our  computations  with  no  noticeable  loss  of  accuracy, 
but  in  some  cases  erroneous  results  were  obtained  when  the  aspect  ratio  was  extremely 
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large.  The  weight  function  was  computed  with  Co  =  c\  =  1  and  c-i  —  0,  since  it  is  doubtful 
that  the  second  derivatives  can  be  reliably  approximated  for  these  types  of  problems.  Only 
one  smoothing  iteration  was  used. 

The  first  example  is  the  computation  of  the  underwater  explosion  of  an  infinite  cylindri¬ 
cal  charge  of  TNT  as  depicted  in  Figure  1.  This  was  solved  as  a  one-dimensional  problem 
with  a  500  by  2  grid.  Figures  2  and  3  may  be  used  to  compare  the  solutions  at  t  -  0.01 
seconds  computed  using  a  uniform  and  an  adaptive  grid.  The  location  of  every  tenth  cell  is 
indicated  along  the  top  borders  of  the  plots.  The  adaptive  grid  clearly  generated  a  smoother 
and  more  realistic  pressure  profile.  However,  neither  peak  pressure  was  near  the  theoretical 
value  in  Cole  [3].  It  would  have  taken  three  times  as  many  grid  points  with  a  uniform  grid 
to  achieve  the  same  resolution  near  the  shock  wave  as  was  achieved  using  the  adaptive  grid. 
The  lower  peak  pressure  with  the  adaptive  grid  is  primarily  due  to  the  smoothing  effect  of 
the  interpolation  between  grids. 

The  next  example  is  the  solution  of  an  axisymmetric  problem.  A  charge  of  TNT  in  the 
shape  of  a  cylinder  is  detonated  underwater.  The  initial  stage  of  the  solution,  illustrated 
in  the  axial  and  radial  coordinate  system,  appears  in  Figure  4.  The  solutions  computed  on 
the  uniform  and  adaptive  grids  are  compared  at  two  different  times.  Figure  5  is  a  plot  of 
the  solution  on  the  uniform  grid  at  an  early  time  when  the  shock  wave  is  near  the  charge. 
Figure  6  is  the  solution  on  the  adaptive  grid  at  the  same  time.  Note  the  high  concentration 
of  grid  points  near  the  charge  that  is  needed  at  the  early  stage  of  the  computations  in 
order  to  keep  oscillations  from  initiating  and  propagating.  Figures  7  and  8  are  plots  of 
solutions  on  the  uniform  and  adaptive  grids  at  a  later  time  when  the  shock  wave  is  near 
the  outer  boundary  of  the  computational  region.  Oscillations  in  the  solution  computed  on 
the  uniform  grid  can  be  observed  especially  near  the  axis  of  symmetry.  The  solution  also 
fails  to  develop  into  a  spherically  symmetric  solution  as  discussed  by  Cole  [3].  On  the  other 
hand,  the  solution  computed  on  the  adaptive  grid  develops  into  a  nearly  spherical  solution 
with  very  little  indication  of  grid  orientation  effects. 

The  adaptive  rezoning  scheme  has  also  been  used  in  the  computation  of  airblasts.  A 
spherical  charge  of  HMX  explosive  is  detonated  in  the  atmosphere  above  the  soil  surface  as 
illustrated  in  Figure  9.  The  solutions  on  the  uniform  and  adaptive  grids  before  the  blast 
front  reaches  the  surface  are  plotted  in  Figures  10  and  11.  The  grid  for  this  problem  is 
sufficiently  fine  so  that  both  solutions  appear  quite  reasonable.  However,  with  the  uniform 
grid  there  is  a  noticeable  perturbation  in  what  should  be  circular  contour  lines  near  the 
axis  of  symmetry.  Both  solutions  were  continued  and  correctly  modelled  the  reflection  of 
the  shock  wave  off  of  the  soil  surface.  The  solution  on  the  uniform  grid  is  plotted  in  Figure 
12.  The  solution  on  the  adaptive  grid  appears  in  Figure  13.  Note  the  concentration  of  grid 
points  at  the  surface.  This  is  due  to  the  large  pressure  gradient  between  the  air  and  soil 
which  did  not  allow  the  grid  to  follow  the  reflected  shock. 

The  final  example  is  the  computation  of  an  underground  explosion.  A  spherical  charge 
of  nitromethane  is  buried  below  the  surface  of  the  soil  as  indicated  in  Figure  14.  The 
computed  solutions  on  a  uniform  and  adaptive  grid  are  plotted  in  Figures  15  and  16.  The 
coarse  grid  effects  of  the  uniform  grid  are  clearly  evident  in  the  noncircular  contour  lines. 
Both  of  these  solutions  were  continued,  but  neither  grid  was  capable  of  correctly  modeling 
the  shock  wave  as  it  passed  through  the  air/soil  interface. 


4  CONCLUSIONS 


It  has  been  demonstrated  that  adaptive  rezoning  can  significantly  improve  the  quality  of 
numerical  solutions  computed  on  a  Cartesian  grid  system.  Grid  orientation  effects  were  no¬ 
ticeably  reduced  in  the  examples  included  in  this  report.  For  problems  involving  explosions, 
the  adaptive  grid  algorithm  can  be  automated  so  that  the  grid  points  are  concentrated  in 
regions  of  high  pressure  gradient. 

It  should  be  noted  that  the  capability  of  rezoning  is  limited  by  the  fact  that  the  HULL 
code  uses  only  Cartesian  grids.  The  ability  to  refine  the  grid  locally  would  be  a  convenient 
feature  in  adapting  the  grid  to  the  solution,  but  that  would  require  a  major  restructuring 
of  the  code. 
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Figure  1.  Initial  configuration  for  an  underwater  explosion  of  an  infinite 
cylindrical  charge  of  TNT. 
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Figure  2.  Pressure  values  calculated  on  a  uniform  grid  at  t=10  milliseconds. 
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Figure  3.  Pressure  values  calculated  on  an  adaptive  grid  at  t=10  milliseconds. 
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Figure  4.  Initial  configuration  for  an  underwater  explosion  of  a  finite 
cylindrical  charge  of  TNT. 
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Figure  5.  Pressure  contours  and  velocity  vectors  calculated  on  a  uniform  grid 

at  t=200  microseconds. 
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Figure  6.  Pressure  contours  and  velocity  vectors  calculated  on  an  adaptive  grid 

at  t=200  microseconds. 
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Figure  7.  Pressure  contours  and  velocity  vectors  calculated  on  a  uniform  grid 

at  t=?00  microseconds. 
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Figure  8.  Pressure  contours  and  velocity  vectors  calculated  on  an  adaptive  grid 

at  t=800  microseconds. 
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Figure  9.  Initial  configuration  for  an  atmospheric  explosion  of  a  spherical 

charge  of  HMX. 
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Figure  10.  Pressure  contours  and  velocity  vectors  calculated  on  a  uniform  grid 

at  t=500  microseconds. 
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Figure  11.  Pressure  contours  and  velocity  vectors  calculated  on  an  adaptive  grid 

at  t=500  microseconds. 
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Figure  12.  Pressure  contours  and  velocity  vectors  calculated  on  a  uniform  grid 

at  t=2  milliseconds. 
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Figure  13.  Pressure  contours  and  velocity  vectors  calculated  on  an  adaptive  grid 

at  t=2  milliseconds. 
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Figure  14.  Initial  configuration  for  an  underground  explosion  of  a  spherical 

charge  of  nitromethane. 
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Figure  15.  Pressure  contours  and  velocity  vectors  calculated  on  a  uniform  grid 

at  t=400  microseconds. 
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Figure  16.  Pressure  contours  and  velocity  vectors  calculated  on  an  adaptive  grid 

at  t=400  microseconds. 
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Abstract 

The  idea  of  using  estimation  algegras  to  construct  finite  dimensional  nonlinear  filters  was  first  proposed 
in  Brockett  and  Clark  [1],  Brockett  [2]  and  Mitter  [3].  In  his  famous  talk  at  the  International  Congress  of 
Mathematics  in  1983,  Brockett  proposed  to  classify  all  finite  dimensional  estimation  algebras.  An  affirmative 
solution  to  Brockett’s  problem  will  allow  us  to  construct  all  possible  finite  dimensional  recursive  filters  from 
the  Lie  algebraic  point  of  view. 

In  1990,  the  first  named  author  [21]  considered  a  general  class  of  nonlinear  filtering  systems  which 
include  both  Kalman-Bucy  filtering  systems  and  Benes  filtering  systems  as  special  cases.  A  simple  algebraic 
necessary  and  sufficient  condition  was  established  for  an  estimation  algebra  of  this  class  of  filtering  systems 
to  be  finite  dimensional.  Consequently  he  has  rigorously  constructed  a  new  class  of  finite  dimensional  filters 
which  include  both  Kalman-Bucy  filters  and  Benes  filters  as  special  cases.  Note  that  the  method  used  in 
[21]  computes  the  fundamental  solution  of  the  D-M-Z  equation  and  hence  it  also  solves  filtering  problem 
with  non-Gaussian  initial  conditions.  In  [5,22],  the  concept  of  an  estimation  algebra  with  maximal  rank 
was  introduced.  This  is  the  most  important  general  subclass  of  estimation  algebras  because  there  is  no 
assumption  on  the  drift  term  of  the  nonlinear  filtering  system.  The  first  named  author  and  Chiou  have 
already  classified  all  maximal  rank  finite  dimensional  estimation  algebra  with  state  space  dimension  at  most 
2  [5,22].  In  this  report  we  continue  the  project  and  study  the  case  for  state  space  dimension  3.  Consequently, 
we  have  shown  that  at  least  for  low  dimensional  state  space  the  finite  dimensional  filters  constructed  in  [21] 
are  the  most  general  filters  from  Lie  algebraic  point  of  view. 


1.  Problem  formulation 

The  filtering  problem  considered  here  is  based  on  the  following  signal  observation  model: 

.  .  (  dx(t)  =  f(x(t))dt+ g(x(t))dv{t)  x(0)=xo 

'  '  \  dy(t)  =  h(x{t))dt  +  dw(t)  y(0)=yo 

in  which  x,  v ,  y  and  w  are  respectively  R",  Rp.  Rm  and  Rm  valued  processes,  and  v  and  w  have  components 
which  are  independent,  standard  Brownian  processes.  We  further  assume  that  n  =  p,f,h  are  C°°  smooth, 
and  that  g  is  an  orthogonal  matrix.  We  will  refer  to  x(t)  as  the  state  of  the  system  at  time  t  and  to  y(t)  as 
the  observation  at  time  t. 

Let  p(t,x)  denote  the  conditional  density  of  the  state  given  the  observation  {y(s)  :  0  <  s  <  t}.  It  is 
well  known  (see  [8],  for  example)  that  p(t,x)  is  given  by  normalizing  a  function.  <r(t,x).  which  satisfies  the 
following  Duncan-Mcrtensen-Zakai  equation: 


m 

(1.2)  d<r(t,  x)  =  L0<r(t,  x)dt  +  ^  Licr(t,  x)dy,(t),  cr(0,x)  =  <ro 

i—  1 


where 


- 1  y  il  _  y  t.  JL  _  y 

-  2  L,  dx?  ^  U  At,  L< 


i=1 


i=l 


dxi 


i=i 


a/, 

dxi 
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and  fori  =  1,  •  •  • ,  m,  L,  is  the  zero  order  differential  operator  of  multiplication  by  A, .  era  is  the  probability 
density  of  the  initial  point  xo.  In  this  paper,  we  will  assume  oo  is  a  C°°  function. 

Definition  :  If  X  and  Y  are  differential  operators,  the  Lie  bracket  of  X  and  K,  [X,  Y),  is  defined  by 

[X,Y\ip  =  X{YiP)-Y{Xip) 


for  any  C°°  function  ip. 


Definition  :  The  estimation  algebra  E,  of  a  filtering  problem  ( 1. 1 )  is  defined  to  be  the  Lie  algebra  generated 
by  {Lo,Li,  -- ,Lm)  or  E  =:<  Lo,Li,--- ,Lm  >l.a.-  If  in  addition  there  exists  a  potential  function  ip  such 
that  fi  =  for  all  1  <  i  <  n,  then  the  estimation  algebra  is  called  exact. 

Define 


D,= 


a 

dxi 


-fi  , 


Then 


We  need  the  following  basic  results  for  later  discussion. 

Theorem  1.  (Ocone)  Let  E  be  a  finite  dimensional  estimation  algebra.  If  a  function  £  €  £,  then  £  must 
be  at  most  a  quadratic  polynomial. 

Ocone’s  theorem  ([12j,  see  [6J  for  an  extension)  says  that  h\,  --  ,hm  in  a  finite  dimensional  estimation 
algebras  are  polynomials  of  degree  <  2. 

Definition  :  The  estimation  algebra  £,  of  a  filtering  problem  (1.1),  is  said  to  be  the  estimation  algebra 
with  maximal  rank  if  x,  +  c,  €  E  for  all  1  <  i  <  n  where  Cj  is  a  constant. 

Remark  :  If  m  >  n,hi(x)  =  £>=i  aijxj  +  Cj  for  1  <  i  <  m  with  A  =  (ai;)nxn  invertible,  then  E  is  of 
maximal  rank. 

The  following  theorem  in  [20]  plays  a  fundamental  role  in  the  classification  of  all  finite  dimensional 
estimation  algebras. 

Theorem  2.  Let  E  be  a  finite  dimensional  estimation  algebra  of  (1.1)  satisfying  where 

wij  axe  constants  for  all  1  <  i  <  n.  Then  hi ,  •  •  • ,  hm  are  polynomials  of  degree  at  most  one.  Furthermore, 
if  E  is  of  maximal  rank,  then  r)  is  a  polynomial  of  degree  at  most  2  and  £  is  a  real  Lie  algebra  spanned  by 
:  1  <  i  <  n}. 


2.  Classification  of  finite  dimensional  maximal  rank  estimation  algebra 
§2.1  Classification  Theorem 

Main  Theorem  :  Suppose  that  the  state  space  of  the  filtering  system  (1.1)  is  of  dimension  three.  If  E 
is  the  finite  dimensional  estimation  algebra  wih  maximal  rank,  then  £  is  a  real  Lie  algebra  of  dimension  8 
with  basis  given  by  {l,x\,X2,X2,D\,D-i,Dz,LQ}. 

In  light  of  Theorem  2  above,  it  suffices  for  us  to  establish  that  =  constant  Vi,y.  Since  ujt]  =  [£_,,£),] 
must  be  in  E,  is  a  polynomial  of  degree  at  most  two. 

First  of  all,  we  ll  show  that  w.y’s  are  polynomials  of  degrees  at  most  one. 

Let  Wij  —  degree  2  homogeneous  part  of  u Uj 


W\2  =  fll2Xj  +  b  j2Xj  +  Ci2Xj  +  d\2X\Xi  +  612X1X3  +  f\2x2x3 

Wl3  =  «13*1  +  613X2  +  ^13*3  +  ^13*1*2  +  e]3*l*3  +  /l3*2*3 

XU23  —  023*1  +  623z2  +  c23*3  +  dijXiXj  +  e23X,X3  +  /23X2X3 
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dfj  dfi 

- - - —  =  W12  =  w\2  +  Pm 

ax i  axj 

5/j  dfi 

- - - —  =  u»i  J  ss  u;13  +  PM 

OX\  OX  3 

df3  a/, 

- - - -  =  W2J  as  UJ23  +  P23 

0X2  ox* 

where  p,j  is  the  linear  part  of  city.  Then  we  have  the  following  basic  cyclic  relationship  which  is  also  true  for 
general  state  dimension  n  >  3  : 

Lemma  1:  W  —  =  (C)  Jx3  if  any  two  of  the  itty’s  with  i  £  jzre  0. 

Proof  :  There  are  (*)  =  3  cases:  (i)u>i3  =s  to23  =  0,(ii)w23  —  u>i2  =  0,(iii)u;i2  =  W23  =  0.  Writing 


Q  —  I  —  u/\2  0 

\  -WU  -<jJ2i 


u/12  W13 

0  wjj 
“W23  0 


then  is  a  vector  with  entries  in  E  (by  considering  [[Lo,-Di],^n])-  For  case  (i),  it  suffices  for  us 

to  consider  operators  u/i2  and  u/u  only.  Operate  them  on  wl2  and  using  Ocone’s  result,  we  conclude 
that  W  does  not  depend  on  X[,  x2  i.e.  ut12  =  cnx|,u;i3  =  tu23  =  0.  The  above  cyclic  relationship  (2.1) 
implies. that  2cl2x3  is  a  constant,  hence  ci2  must  be  zero.  Cases(ii)  and  (iii)  can  be  treated  similarly  and  we 
are  done,  i.e.  W  =  (0)3xJ  as  desired. 

Q.E.D. 

Lemma  2:  If  one  of  the  uty’  s  is  zero  where  i  ^  j,  we  have  the  following  relationships  among  the  coefficients 
in  uty’s: 


(i)  u/23  =  0  =» 

(ii)  inn  =0  => 

(iii)  wi  3  =0  => 


dwi3 

d\U\2 

dl2 

dl3 

dvU23 

dw  13 

dxi 

5x2 

du>23 

dw\2 

dx\ 

dl3 

=  0  and  dj3  =  e]2,26i3  = /i2./i3  =  2cj2; 

=  0  and  d13  =  2a23,26i3  =  d23,/j3  =  e23; 

=  0  and  e12  =  -2a23,2ci2  =  -e23,/12  =  -<f23. 


Proof  :  These  are  easily  verified  by  equation  (2.1).  Q.E.D. 

§2.2  ft  is  linear 

For  the  proof  of  W  =  (0)Jx3  in  general,  we  define  the  operators: 

d  9 

U\  u/j2  — -  +  ^13  - - 

OX2  0X3 

d  d 

D 2  :=  -wi2  -z - 1-  u>23  - — 

ox\  dz$ 

d  d 

D 3  :=  -wi3  - - W23  - — 

OX  \  OX  2 

Since  E  contains  Yj  =  so  (Vj,wi,|]  =  £1=1  u'risf“(wu)  €  E,  i.e.  5ZLi  w;i^7(wki)  are  polynomials 

of  degree  at  most  two.  We  may  set  the  coefficients  of  terms  with  degree  3  equal  to  zero.  But  these  terms  arise 
from  degree  2  terms  of  u»;i’s  only,  therefore  it  suffices  for  us  to  consider  uty  instead  of  uty  in  the  following 
calculation. 
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Let 


Dtxvjk 300  :=  coefficient  of  x?  of  Di(wjk) 

Diwjk030  :=  coefficient  of  x\  of  Di(w}k) 

DtWjk 003  :=  coefficient  of  xj  of  Di(wjk) 

Diwjk 210  :=  coefficient  of  x\x7  of  Di(wjk) 

Diwjk2Ql  :=  coefficient  of  x?x3  of  £»(»,•*) 

Diwjk  120  :=  coefficient  of  x\xx  of  Di(wjk) 

Diwjk021  :=  coefficient  of  xfx3  of  Di(wjk) 

DiwjklOi  :=  coefficient  of  XjXj  of  Di(w<k) 

DiwjkOn  coefficient  of  x3x2  of  Di(wjk) 

Diwjklll  coefficient  of  xix2x3  of  Di(wjk) 

Objective  :  To  show  that  all  coefficients  of  to,/ s  are  zero,  i.e.  the  system  of  90  equations  has  only  trivial 
solution. 

Observe  that  Dlu>i2003  =  cj2(/i2  +  2ci3). 

Suppose  Ci2  ^  0.  =►  /i2  =  —  2ci3,  Z?1u/i3030  =  6i3(/i3  +  26i2). 

Suppose  613  #  0.  =>  /i3  »  -26,2  =>  J32um2003  =  cl2(2c23  -  e12),Z?3iui3030  =  -&13(dn  +  2623)- 
=*■  el2  =  2c23,  <ii3  =  -2623.  s»  D2u;23300  =  a23{e23  -  2a12),  D3u/23300  =  -o23(d23  +  2ai3). 

Suppose  o23  ^  0.  =>  e23  =  2ai2ld23  =  -2at3.  =>  Dlw23030  =  26i2623  +613/23,Z>3wi2030  =  -2f»i 2623  - 
bi3d12.  Add  these  two  gives  b i3(/23  -  du)  =  0.  Since  6l3  /  0  is  assumed,  we  have  /23  =  ef12. 

Suppose  dl2  ?  0.  =*•  6i2,623,ci3,c23,a23,oi3  /  0  since  £>lu/23030  =  26^633  +  d126i3,  i?lu;23003  = 
ci2d12  +  2c23cJ3,  D2uj23210  =  -2al2a13  -  dx2a23.  Moreover  from  Dlu»12021,  £>1u>23120,  P2ti;i2201,  and 
J32njl202l’ we  can  obtain  the  following  proportional  relationships:  a23/a12=  c23/cl2,  013/012= 

613/6i2=  ci3/cl2  and  a23/ol3=  623/bi3=  c23/cx3. 

Observe  that  u/23=  <*23*1  +  &23*2+  2a\zX\X2+  2a\iX\X$+  d\2X2X%  Using  the  expressions  —a  12 

/2=  a12ai3/a23=  613623/613=  ci3c23/cu  it  can  be  shown  that  o23u/23=  (a23xi-  <*13*2+  ai2x3)  .  Similarly, 
a23u>i2=  «12  («23*r  “13*2+  «12*3)2  and  o|3u^i3=  ai3  (023*1*  «i3*z+  ai2x3)2.  The  cyclic  relation  (2.1) 
implies  that  0=  ^u+  $*■+  ^f  =  (o23x,-  a,3x2+  o,2x3)J(l+  *£+  f£).  Contradicting  that  a12,  a13, 

023  7^0.  Hence  d]2  ^  0  under  ci2,  6i3,  a23  ^  0  is  impossible. 

Then  we  are  working  with  tfi2  =  0  under  the  assumptions  that  Ci2,  61 3,  a23  ^  0.  ^  D2wi2lll—~ 4ai3ci2 
=>.  au  =  0  =!►  Plio23l20=2ai2613,  D2wun0--2a23cx3,  H2u;13300=a23ei3,  Z?3wi3300s=-a23di3,  P3wi3210=- 
2a236i3,  D3ti;23030=-26|3  =>  a12=  ci3=  e13=  dI3=  6i3==  623=0  =>  Z32toi2201=2  a23Ci2.  Contradicting  that 
a23  /  0  and  ci2  #  0.  So  it’s  impossible  for  o23  ^  0  under  cj2,  613  ^  0. 

Then  a23  =  0  under  the  assumptions  that  cj2,6i3  ^  0.  £?2u>i2300  —  — 2a32,  D3u>i3300  =  — 2a13 

=>  an  =  0, on  =  0  =>  U2ti/23201  =  e23,  D3u>23210  =  -d23  =>  e23  ==  d23  =  0  =>  D2ui23003  =  2cf3, 

D3«ti23030  as  -26|3  =»  c23  a=  623  =s  0  =>  £ltu23030  =  613/231  P2u>i2003  =  -ci2ei2,  D2tvy2120  =  -d\2, 

D2w13m  ==  -el3c,2,  J23wi3120  =  -d\3  =>  /23=  e12=  dl2=  ei3=  d13==0  Now  u/23  ==  0.  (2.1)  gives  ci3  = 
-613.  Then  D1uj12030=  2b(2-  26,3c13==  2(6?2+  6?3),  so  613  =  0.  Contradiction.  So  6,3  /  0  cannot  come 
together  with  ci2  #0. 

Then  613  =  0  under  the  assumption  cJ2  ^  0.  =>  J91tui2030  =  26f2,  f?3tc23030  =  -2623  612  =  0, 

623  =  0  s>  J91u;13021  ==  ff3,  D3wl3l20  =  -df3  =0  /i3  ==  0,  d,3  -  0  =>  Dlwi2012  =  2cf3, 1/3in,3300  =  -2o23, 
=>  Cl3  =  0,  al3  as  0  =>  Dlw12210  -  df2,  D lu»J3201  =  ef3,  D2u;23021  =  /|3,  D3tc23210  =  -d\3  =>  du=  «i3= 
fz3—  d23=0  =>  Wi3  =  0.  By  Lemma  2  e23  =  -2ci2.  Together  with  D2w23003  =  -e23ci2  +  2c23  we  obtain 
0=2(cf2  +  c|3).  Contadicting  Ci2  5*  0.  Then  ci2  cannot  be  non-zero. 

Now  we  knew  that  Ci2  =  0  and  no  other  restrictions.  Then  D1u;i3003  =  2cf3,  D2u;23003  ==  2 q3. 

=>  cu  =  0,  c23  =  0  =>  D1u>i2012  ==  /?2,  U2u;i2102  ==  -e\2  =>  /12  =  0,  e12  ==  0  =>  Pltn12030  ==  2622, 

£12ti;i2300  ss  — 2af2  =>  612  =  ai2  =  0  =>  /?lu/i22i0  =  d\2,  Z?ltni3201  =  ej3,  i?ltni3021  ==  /f3,  f?2w23  201  == 
e^3,  £>2u;23201  as  /|3  a>  <j12=:e13=/13=e23=/23  ==  0  a>  wl2  =  0.  Lemma  2  gives  di3  ==  2o23,  d23  =  26„. 
Together  with  P3inu300  ==  -2a?3  -  a23dX3  and  £>3u;23030  =  -d236,3  -  26|3,  we  can  conclude  that  w,3  = 
w23  as  0  also. 

n  as  (u>ij)3  x3  is  shown  to  be  linear. 
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§2.2.1  Appendix 

There  are  90  quadratic  equations  arised  from  the  coefficients  : 


toll 


300,030,003 

Oi2</i2  +  Oi3ei2,  26ij2  +  613/12,  C12/12  +  2C13C12 

210 

<*12  +  2012612  +  013/12  +  di3ei2 

102 

O12/12  +  *12^12  +  2013C12  +  613612 

120 

36i2<ii2  +  6isei2  +  <*13/12 

021 

3612/12  +  2613C12  +  /13/12 

102 

c12<*12  +  C12A2  +  C13C12  +  2ei3Ct2 

012 

fl2  +  2C12612  +  C13/12  +  2/13C12 

111 

21*12/12  +  2612612  +  2duCi2  +  eu/12  +  /isei2 

D\™U 


300,030,003 

Ol2<*13  +  Onen,  2&12&13  +  613/13,  2ci[3  +  /13C12 

210 

2012613  +  <*i2<*is  +  013/13  +  di3ei3 

201 

<*12/13  +  <*13612  +  2ai3Ci3  +  ef3 

120 

6i2<*u  +  2di2&i3  4-  613613  4-  <*13/13 

021 

612/13  4*  26i3/i2  +  26i3Ci3  +  /i23 

102 

dl3Cl2  +  fl3Cl2  +  3ci3ei3 

021 

26i3C12  +  /13/12  4-  3ci3/i3 

111 

<*12/13  +  2613612  +  d\jfi2  4-  2di3Ci3  4-  2ei3/i3 

Qi.wn _ _ 

300, 030, 003  andjj  +  013623,  2612623  +  613/23*  C12/23  +  2C13C23 

210  2012623  +  dudiJ  +  013/23  +  <$13^23 

201  012/23  +  ei2<i23  +  20i3C23  +  ei3C23 

120  612^23  +  2^12633  +  613623  +  dn/23 

021  612/23  +  2/12623  +  2613C23  +  /13/23 

102  Cl2<^23  +  C12/23  +  Cise23  +  2ei3C23 

012  2C12623  +  /12/23  +  C13/23  +  2/13C23 

HI _ <*12/23  4-  2en623  4-  /i2<*23  4  2^13033  +  613/33  ■+  / 13623 


300, 030, 003  —  2oj2  +  023612,  — 6i2<*i2  +  /i2623*  —612612  4-  2C23C12 

210  —  3oi2dl2  +  O23/12  +  ei2<*23 

201  — 3012612  +  2023C12  +  633612 

120  -2013612  -  di2  +  613633  +  f\2d23 

021  —612612  -  <*12/12  +  2C12&23  +  /l2/23 

102  — 2C12O12  —  e^  +  023612  +  2e23Ci2 

012  — 012^12  —  612/12  +  C23/12  +  2013/23 

_ ^1 _ -2ei2lli2  —  2012/12  4-  2012^23  +  623/12  -l~  612/23 


300, 030,003 

-2ai2<M3  4-  023613*  -612^13  4-  633/13, 2ci3C23  -  ei3ci2 

210 

—  012^13  —  2^12013  +  023/13  +  <123613 

201 

ene23  -  ®iieij  -  2ai3ei2  4-  2o23Cij 

120 

-26i2oi3  -  d]2d}3  +  623613  4-  <*23/13 

021 

/13/23  -  6i2ei3  -  <*13/12  +  2l/23Ci3 

102 

2613623  +  ei3C23  —  2ai3Ci2  —  613612 

012 

2C13/23  4-  /13C23  —  d  13C12  —  613/12 

111 

613/23  +  /13623  —  <*12613  -  <*13612  —  2oi3/i2  4-  2d23C\j 

D?  lt’23 
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300,030,003 

-20)2023  +  <*23*23.  -6t2d23  +  623/23,  -*2302  +  2 Cj3 

210 

-<*12^23  ~  2di2<*23  +  023/23  +  ^23*23 

201 

*23  ~  <*12*23  -  2(123*12  +  2a23C23 

120 

-2bi2a23  -  d[2d23  +  i*2S3«23  +  <123/23 

120 

/23  -  6i2e23  -  /i2</23  +  2623c23 

102 

-2a23cj2  —  e23*i2  +3c23e23 

012 

-*12<123  -  e23/l2  +  0c23/23 

111 

—  <ll2*23  —  <\2<l23  -  2a23/l2  +  2d23c23  +  2e23/23 

Dswi 


300, 030, 003  —  dj2a23,  —2612623  —  di2613,  — *13*12  —  C23/12 

210  —  d32d23  —  2012^13  —  d]20i3  —  26i2fl23 

201  -2o!2<13  —  <*13*12  —  a2s/l2  ~  <ll2«23 

120  —  26j2d23  —  dj2623  —  di2dj3  —  20l26j3 

021  -2612/23  -  /12623  -  613C12  -  d\ijn 

102  “*13*12  -  2ci30i2  -  C23<ll2  ~  *23/l2 

012  -/12/23  -  Ci3<il2  -  /l3Cl2  -  2C23&12 

111  —<il2/23  —  /l2^23  —  d\2t  13  -  dj3el2  —  2ai2/i3  —  26i2«23 


300,030,003 

—  2 af3  —  <223^13*  -^13^13  —  2&23&131  ~C\$e\$  —  /13C23 

210 

~3ai3d13  —  2a236j3  -  d23di3 

201 

-3<*i3ei3  -  a23/]3  -  di3e23 

120 

-df3  -  26i3ai3  -  b23d13  -  2d23bj3 

021 

-613*13  -  di3/i3  -  623/13  -  2613/25 

102 

-*13  -  2ai3Ci3  -  du*23  -  /l3*23 

012 

-di3*i3  -  *13/13  -  2613c23  -  /i3/23 

111 

-2di3ei3  -  2ai3/i3  —  d23/i3  -  2b13e23  -  d13/23 

300,030,003 

—  2ai3a23  -  <*23^23>  — 2&23  ~  ^23^I3i  ”^13623  *“  C23/23 

210 

— d23  —  <3 1 3  <^23  —  2o23di3  —  2a23623 

201 

—<*13*23  —  2a23ei3  -  a 23/23  —  d23e23 

120 

— 2a236i3  —  d23di3  —  3623d23 

021 

— 613*25  —  d23/i3  —  3623/23 

102 

-2<*23ci3  -  ei3e23  -  d23c23  -  e23/23 

012 

— /23  -  <123*13  -  /l3e33  -  2623c23 

111 

—d  13*23  -  d23ei3  -  2a23/i3  -  2 d23/23  -  2623e23 

We  have  the  expression  ci2(/i2  +  2c13),  which  is  I?iuii2003. 

§2.3  Q  is  constant 

Our  next  task  is  showing  that  fi  is  in  fact  constant  matrix. 

Since  E  is  finite  dimensional  with  maximal  rank,  there  exist  constants  c,'s  such  that  x,  +  Ci  6  E  for 
i  —  1.2.3.  Then  we  have  the  following  elements  in  E:  Dj  =  \La,Xj  +  c;],  wji  =  [D„Dj],  and 

([£0,0;],  0*1  =  -  ^Di)  -zYsdlJti' 

As  uiji  is  linear,  so  we  may  infer  that 


V  1  d2f]  _  r, 

"2 d^dT}eE 


34? 


More  compactly  we  write 


nnr  -  ^Heaa(q)  or  fi2  +  ^#esa(rj) 
z  • 


which  is  a  matrix  with  entries  in  E.  Ocone’s  result  says  that  these  entries  are  at  most  quadratic  polynomials. 
So  t)  is  a  polynomial  of  degree  at  most  4. 

If  E  possesses  a  nontrivial  quadratic  polynomial,  the  argument  is  very  involved  and  we’ll  not  discuss  it 
here.  We  consider  the  case  that  E  has  at  most  linear  polynomials. 

Writing  Pi  =  {  polynomials  with  degree  at  most  i}. 

Let  A  =  Aii  +  fli2+Cz3  (mod  P0)  where  A  =  (a0)3x3,  B  =  (6i>)3*3,C  =  (ci>)3x3  are  skew-symmetric 
matrices.  We  make  use  of  Q2  +  j#esa(q)  =  0  mod  Pi  to  infer  that  A  —  B  =  C  =  (0)3x3  as  follows.  Let 

H  ~  n2  =  HUx]  +  1*22*2  +  #33*3  +  1*12*1*2  +  #13*1*3  +  #23^2*3 

=  A2x2  +  S2x2  +  C2x2  +  ( AB  +  BA)iiXz  +  (AC  A  CA)x  1X3  +  ( BC  +  CJ3)x3x3 
We  consider  terms  in  rj  and  relationships  derived  from  fi2  +  ^#ess(  ij)  =  0  mod  P\  in  terms  of  entries  in  #,2 
matrices.  Coefficient  of  x2x|  in  -1 ]  =  #ui2,2]  =  #22(1, 1]  =  ^#12(1. 2).  Similarly,  jffi j {3, 3]  =  #33[1,1]  = 
2#i3[l,3j  (Hijlp,q]  means  the  (p,q)-entry  of  matrix  Hi}).  We  have  : 


(2.2) 

(2.3) 


a12  +  a23  =  ^12  +  ^13  =  2^ai3^23  +  a23*«s) 
a13  +  a23  ~  C12  +  C13  —  ~  2^ai2C23  +  fl23C»2) 


Suppose  A,B,C  are  not  all  zero  matrices,  without  loss  of  generality  we  may  assume  that  A  £  03x3: 
Consider  terras  x\  JV  k*  1  *j*fc  ‘n  *?•  There  exists  an  orthogonal  transformation  R  leaving  xj  fixed  and 
changes  Yijk*  1  *>*fc  to  canonical  form  *2*2  +  fc3x2  where  x  =  Rx.  So  without  loss  of  generality  we  may 
assume  from  the  beginning  that  17  does  not  contain  the  term  xfx2x3,  which  implies  that  #n[2,3|  =  0.  Hence 

<*12013  =  0. 

Suppose  a13  =  0.  (2.2)  implies 


*12  A  623  +  a2j  +  a23  —  023613  <  -(a23  +  b\z) 


1. 


+  a: 


23  S  0 


=f26f2  +  6?3  +  2a22 

=>0)2  —  O  23  —  0 

So  .4  ss  03x3.  Contradiction.  Similarly,  ai3  =  0  will  lead  to  a  contradiction  also.  Hence  A.B.C  are 
simultaneously  zero  matrices. 


For  reference  we  list  out  the  Hi,  matrices  below  : 


H\i  —  — 


#22  = 


#33  =  - 


#12  = 


#13  = 


#23  = 


013023 


*12 


+  a 


23 


012013 

613623 
652  +  623 
612613 

C13C23 
Cl2  +  C23 


I  CJ3C23 
\  C13C23  Ci2Ci3 

(2012612  +  2ai36i3 
013623  +  023613 
-012623  -  023612 

(20i;Ci2  +  20i3Ci3 
O13C23  +  O23C13 
-O12C23  ~  O23C12 

(2bi2Ci2  +  2bi3ci3 
613C23  +  623  Ci3 
—  612C23  -  623C12 


012023 
Ol2°13 
a13  +  a23  , 
612633  \ 

612613  I 

623  +  b23  / 

C12C23  \ 
C12C13  I 


c13  +  c23  / 
013623  +  O23613 
2012612  +  2023623 
O12613  +  O13612 

013C23  +  “23C13 
20i2Ci2  +  2023C23 
Ol2C)3  +  O13C12 

bl3C23  +  633013 
26i2Cn  +  2623C23 
6l2Cl3  +  613C12 


-O12623  “  O23612  \ 
O12613  +  O13612  j 
2ai36i3  +  2033633  ) 

“012C23  -  023C12  \ 
012C13  +  OI3C12  J 
20i3Ci3  -f  2023C23  / 

-612C23  “  623C12  \ 

6l2Cl3  +  613C12  J 

2613C13  +  2623C23  J 
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Abstract 

This  paper  presents  a  hybrid  technique  for  optimal  discrete- time  control  of  a  continuous¬ 
time  Turret-gun  system.  The  optimal  regional-pole  placement  technique  is  utilized  to  de¬ 
sign  a  continuous-time  linear  state-feedback  control  law.  This  law  is  then  converted  to  an 
equivalent  discrete-time  control  law  for  digital  implementation  aided  by  the  recently  de¬ 
veloped  digital  redesign  technique  and  ideal  state  reconstructor.  A  preload  compensation 
is  also  added  to  the  digital  controller  for  reducing  the  effect  of  the  turret  motor  friction. 
A  digital  simulation  of  the  designed  nonlinear  Turret-gun  system  is  presented. 

1.  Introduction 

The  Turret-gun  system  contains  hard  nonlinearities  such  as  Coulomb  friction,  back¬ 
lash,  and  saturation,  and  it  is  subject  to  external  disturbances  such  as  the  firing  distur¬ 
bances  and  base  motion,  as  well  as,  parameter  variations  caused  by  thermal  effects  on  gun 
f  supported  by  the  U.S.  Army  Research  Office  under  contract  DAAL-03-91-G0106  and 
the  NASA-Johnson  Space  Center  under  grant  NAG-9-380 
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barrel  and  torsional  stiffness.  The  primary  objective  of  the  Turret-gun  servo  control  sys¬ 
tem  is  to  rapidly  stabilize  and  accurately  point  the  gun  to  a  target  position  in  the  presence 
of  the  above  nonlinearities,  disturbances,  and  model  uncertainties.  This  requires  that  the 
designed  servo  control  system  has  the  properties  of  robust,  rapid  and  accurate  tracking, 
and  disturbance  and  noise  rejection.  To  develop,  demonstrate,  and  validate  the  advanced 
algorithms  for  Turret-gun  control  system,  the  advanced  weapon  tracking  testbed  (ATB- 
1000)  was  designed  at  the  Army  Research  Development  Engineering  Center  to  provide 
a  realistic  simulation  of  Turret-gun  system  in  laboratory  environment  [1],  The  A  ..ailed 
linear  and  nonlinear  models,  developed  by  Integrated  System  Inc.  (ISI),  axe  used  as  the 
basis  for  this  study. 

The  primary  objective  of  this  paper  is  to  develop  a  digitally  implementable  optimal 
controller  for  the  continuous-time  nonlinear  Turret-gun  system  so  that  the  responses  of  the 
system  converge  at  an  appropriate  speed  and  any  vibrating  modes  are  well  damped.  The 
hybrid  controller  design  methodology  developed  in  this  paper  can  be  briefly  described  as 
follows. 

A  cascaded  internal  model  (which  contains  an  integrator)  and  a  feedback  integrator 
are  inserted  into  the  continuous-time  linearized  model  to  reduce  nonlinear  effects  and  fir¬ 
ing  disturbances.  Based  on  the  above  integrated  linear  model,  a  linear  quadratic  regulator 
approach  [2]  is  utilized  to  design  a  robust  optimal  state-feedback  control  law.  This  law 
optimally  places  the  closed-loop  poles  of  the  above  integrated  linear  model  within  the  com¬ 
mon  region  of  an  open  sector  and  to  the  left  hand  side  of  a  line  parallel  to  the  imaginary 
axis  in  the  complex  s-plane.  Thus,  the  designed  system  responses  converge  at  an  appropri¬ 
ate  speed  and  any  vibrating  modes  are  well  damped.  Moreover,  for  digital  control  of  the 
Turret-gun  system,  recently  developed  digital  redesign  techniques  [3]  are  applied  for  digital 
redesign  of  the  inserted  continuous-time  internal  model  and  the  developed  continuous-time 
optimal  state-feedback  control  law.  In  order  to  implement  these  digitally  redesigned  con¬ 
trollers  without  constructing  a  digital  observer,  a  new  ideal  state  reconstructor  is  developed 
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for  the  estimation  of  the  discretized  states  under  noise  disturbances  and  nonlinear  effects. 


2.  Design  formulation 

A  linear  model  obtained  from  the  given  nonlinear  model  of  the  ATB-1Q00  test  fixture 
is  described  by  following  state  space  equation, 


—  AgZf, 


Vo  —  Coxo 


(1) 


where  x0  <=  7l10xl,  ua  €  K,  ya  €  H6xl,  and  Aa ,  Ba,  C0  are  matrices  with  appropriate 
dimensions.  The  effect  of  nonlinearities  is  simply  ignored  in  this  model,  but  these  will  be 
considered  in  the  formulation  of  the  controller  structure.  The  six  outputs  of  the  system  are 
turret  motor  yaw,  turret  motor  rate,  inertial  wheel  yaw,  strain  gauge  1,  strain  gauge  2,  and 
tip  acceleration,  respectively.  Although  there  are  other  measurable  outputs  on  the  ATB- 
1000  test  fixture,  they  are  not  practically  accessible  for  control  of  an  actual  Turret-gun 
system. 

In  order  to  reduce  the  effect  of  nonlinearities  and  to  increase  the  robustness  of  the 
designed  control  system,  an  internal  model  which  contains  an  integrator  is  inserted  before 
the  system.  The  steady-state  error  of  the  barrel  tip  position  caused  by  nonlinearities  is 
eliminated  by  another  integrator  appended  to  third  output  of  yD,  i.e.,  the  inertia  wheel 
yaw.  Note  that  the  inertia  wheel  yaw  is  used  here  instead  of  the  barrel  tip  position  due  to 
the  non-availability  of  tip  sensor.  With  the  addition  of  these  two  integrators,  the  overall 
system  is  augmented  to  an  12th  order  system  as  shown  in  Fig.  1.  The  augmented  state 
space  equation  is  given  as 


x  ■=  Ax  - 1-  Bu 
y  —  Cx 


(2) 
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where 


*  = 


*»i 

*o 

Xu 


EH12*1,  u6K,yeK 


xn  —  output  of  the  first  integrator 


A  = 


B  = 


en 


13X12 


xn  —  output  of  the  second  integrator 

0  Oixio  0 
B0  A0  Oioxi 

.  0  -Coz  0 
1 

Oioxi  j  €  72.12x1,  C  =  [0  0! 
0 


xlO 


i]  en 


1X12 


The  r  in  Fig.  1  is  the  reference  input  or  the  command  input  for  the  inertia  wheel  yaw 
(which  is  identical  to  the  tip  position  in  steady  state).  Since  r  is  considered  as  an  external 
disturbance  in  the  design  of  the  optimal  state  feedback  control  law,  it  does  not  appear  in 
(2) 


The  eigenvalues  of  the  system  in  (2)  are  {0.0, 0.0, —4.50, —2.11  ±  j'27.57,  —5.76  ± 
j'59.61, —2.52  ±  j’138.43, —5.77  db  j384.88}.  Thus,  the  eigenvalues  have  a  luge  spread. 
Although  the  optimal  regional-pole  placement  method  can  be  directly  used  to  optimally 
place  the  closed-loop  poles  in  the  previously  mentioned  region,  a  rather  large  feedback  gain 
would  be  obtained  as  a  result  of  the  large  eigenvalue  spread.  This  is  undesirable  due  to  the 
presence  of  nonlinearities  and  unmodelled  dynamics.  Since  the  state  feedback  gain  is  to  be 
connected  to  the  observed  or  reconstructed  states,  any  deviation  of  the  observed  or  recon¬ 
structed  state  from  the  actual  state  will  significantly  affect  the  control  effort  which  would 
saturate  the  actuator  and,  even  worse,  destabilize  the  closed-loop  system.  Therefore,  it  is 
advantageous  to  decompose  this  large-scale  and  stiff  system  into  a  completely  decoupled 
multi-time  scale  structure,  so  that  each  subsystem  has  its  own  distinct  characteristics  and 
can  be  designed  accordingly. 

The  multi-stage  design  algorithm  developed  by  Tsai,  tt  al.  [2]  is  modified  and  utilized 
here  to  design  the  system  in  (2).  The  design  procedures  can  be  described  as  follows: 
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Step  1.  The  eigenvalues  of  the  augmented  system  in  (2)  are  divided  into  three  groups 
that  are  located  in  three  circular  rings,  [0,ri],[rj,r2],[r2,r3],  as  shown  in  Fig.  2,  where 
ri  =  15,  r2  =  100, r3  =  400.  The  first  circular  ring  contains  the  eigenvalues  {0,0,0,  —4.5}; 
the  second  circular  ring  {—2.11  ±  j'27.57,  — 5.76  ±  j59.61};  and  the  third  circular  ring 
{—2.52  ±  j  138.43, —5.77  ±  j384.88}.  A  block  modal  matrix  is  constructed  by  using  the 
matrix  sign  algorithm  [3]  to  decompose  the  original  system  into  three  decoupled  subsystems 
corresponding  to  the  three  groups  of  eigenvalues.  The  modal  matrix  is  given  as 

M,  =  [S3  S2  5x]  (3) 


where 

Si  =  ind(sign(+r._t  r;)(h(A))]  6  7^12xn\  1  <  i  <  3 
and  rij  —  4,n2  =  4,  n3  =4 
sign£._i)ri)(h(A))  =  ^[sign(rf_l(h(A))  —  sign(r;)(h(A))] 
h(A)  =  (A-r,-/„)(A  +  ri/n)~1 

In  the  above  definitions,  ind(-)  represents  the  collection  of  the  linearly  independent  column 
vectors  of  (•)  and  r0  =  0,  sign(0)(h(A))  =  /n.  Then 


Aj,  =  M,  1  AM,  =  block  diag(A<i3,  Adi,  Aji) 


Bd  =  MJ1  B  = 


Bdz 

Bdi 

Bdi 


(4) 


where  Adi  G  7 Z4x4,  Bdi  €  7 Z4xl. 

Step  2:  Set  i  =  1,  A  =  Ad  =  block  diag(A3,  A2,  Ai],  B  =  Bd  =  block  diag[2?^,  Bj,  B^]T, 
Mi  =  In,  and  the  feedback  gain  Kc  —  0mxn- 

Step  3:  The  subsystem  considered  for  design  at  this  stage  is  ( A<,  Bi).  Design  this  subsystem 
by  using  the  optimal  pole  placement  method  [2].  Let  the  immediate  optimal  feedback  gain 
be  Ki  and  the  corresponding  continuous-time  closed-loop  system  be  (Ac; ,  2?j). 

Step  4:  Update 


Kc  • —  Kc  +  [0mX(n— 7»i)>  Ki]  A/j 


(5°) 
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Ai  Wi 
L  x(n— rij)  ACi\ 


(5b) 


A  ■ —  A  -3[0mx(n~ nj)>  -^t]  — 

where  A{  =  block  diag  [ACt,  A<],  W{  =  —[Bj,  B'f]'1'  R\.  The  n»  is  the  order  of  the  subsystem 
that  is  being  designed  at  this  stage.  The  dimensions  of  the  matrices  Ay  and  Wt  are 
(n  —  rti)  x  (n  —  riy),  (n  —  n»)  X  nt-,  respectively. 

Step  5:  Block-diagonalize  the  partially  designed  system  A  and  move  the  last  block  of  A  in 
(5b)  (viz.,  ACi)  to  the  first  block,  via  a  transformation  matrix  M2  which  is  given  as 


M2  = 


Ly  In — n, 

— 

6n;x(n— n<) 

.Lin  0«;x(n— n,). 

i  1V±2  — 

Ln—m  ~  Ly. 

(6a) 


The  matrix  Ly  (6  72>n  n<)xn<)  can  be  solved  from  the  following  Lyapunov  equation, 

AyLy  —  LyACi  +  Wi  =  0(n  — n;)  Xni  (66) 


The  transformed  system  is 


A  :=  M~1AM2  = 


ACi  OniX(n-ni) 


L®(n— n<)Xn*  A i 

B  :=  M{1B  =  (BJ ,  (By  -  LyBi)T)T 


(6c) 

(6d) 


where  Bi  —  (2?J ,Bj)T .  Accumulate  the  transformations  in  M\  :=  M\M2. 

Step  6:  Set  i  :=  i  +  1.  If  t  >  k  (k  is  the  number  of  time-scales.  In  this  specific  case,  it 
equals  3),  then  go  to  Step  7;  else,  go  to  Step  3. 

Step  7\  The  state-feedback  gain  for  the  original  system  is  obtained  as  Kc  —  KcMj’1 . 

The  final  result  is  given  as  follows:  with  the  designing  parameters  hi  =  15,6.2  = 
20,  hi  =  15,  the  state-feedback  gain  is  computed  as 


Kc  =  (391.0  23.0  144.0  23.0  250.0  11.0  -43.0  23.0  241.0  19.0  328.3  -41071.0] 

(7) 

The  closed-loop  eigenvalues  are  given  as  (—25.5,  —30.0,  —30.0,  —30.0,  — 37.89±>27.57,  — 66.40± 
>66.40,  -27.48  ±  >138.43,  -24.23  ±  >384.88}. 
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3.  Digital  implementation 
3.1.  Digital  redesign 

In  order  to  implement  the  designed  continuous-time  state-feedback  control  law  in  the 
sampled-data  environment  of  the  ATB-1000  test  fixture,  the  digital  redesign  technique  [3] 
is  used  to  convert  the  continuous-time  control  gain  Kc  in  (7)  to  an  equivalent  discrete-time 
control  gain  Kd •  The  state  feedback  gain  Kc  can  be  converted  [3j  by 

Kd  =  \{Im  +  \KcH)-lKc{In  +  G)  (8) 

where  Kc  is  given  in  (7),  G  —  eAT,  H  —  /QT  eAr  dr,  and  T  =  0.0025  sec.  The  Kd  is 
computed  as 

Kd  =  [Kdi,Kdo,Kd2]  (9) 

where 

Kdi  =  292.89 

Kdo  =  [5.7936, 83.24,21.30, 164.88, 11.77,  -10.91, 12.20, 208.01,  -21.46, 1750.4] 

Kdi  =  -24429.0 

The  digital  control  law  is  then  given  as 

ud(kT)  =  - Kdxd{kT )  =  -KdiXdn{kT)  -  Kdoxdo{kT)  -  KdiXdi2(kT)  (10) 


Furthermore,  the  inserted  integrators  can  be  discretized  by  using  the  Tustin  approximation: 

l  .  2  (i+*“l) 

7 

3.2.  Digital  state  reconstructor  using  recursive  weighted  least-squares  (RWXS) 
algorithm 

Xdii(kT)  and  XdaikT)  in  (10)  are  outputs  of  the  inserted  integrators  and  are  directly 
accessible.  However,  the  state  xdo  is  not  directly  measurable.  Thus,  a  state  reconstructor 
is  developed  in  this  paper  to  reconstruct  the  actual  state  xdo(kT)  from  the  measurable 
outputs  and  inputs  of  the  system  in  (1). 
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Let  the  discretized  state  space  equation  for  system  in  (1)  be 

1  =  G*iok  +  Hv-dk 

(u) 

Vdoh  — 

Then,  the  following  RWLS  algorithm  can  be  used  to  estimate  the  digital  state  in  (11): 


*fc+i  =  Gif,  +  Huj,  4-  Kk{yk  —  Cxk) 

(12a) 

Kk  =  GPkCT(XIp  +  CPkC r)~1 

(126) 

Pk+i  =  \{G  -  KkC\PkGT 

(12c) 

where  it  is  the  estimation  of  the  digital  state  Xdo  at  t  =  kT,  A  is  a  forgetting  factor. 

The  algorithm  presented  here  bears  many  resemblances  to  the  Kalman  filtering  al¬ 
gorithm.  The  RWLS  algorithm  is  simpler  in  terms  of  the  required  knowledge  of  noise 
characteristics,  and  thus  provides  a  practical  alternative  to  Kalman  filtering. 

3.3.  Friction  compensation 

It  is  observed  that  the  Coulomb  friction  presented  in  the  turret  motor  has  a  significant 
undesirable  effect  on  the  overall  system  performance.  The  presence  of  the  Coulomb  friction 
makes  the  system  response  rather  sluggish,  and  the  system  response  takes  much  longer  time 
to  settle  down.  Simply  increasing  the  feedback  gain  does  not  solve  the  problem  due  to  the 
limitation  of  the  actuator.  In  this  paper,  an  inverse  nonlinear  technique  is  utilized  to  deal 
with  this  problem.  Specificly,  a  nonlinear  compensation  term  is  added  to  the  input  of  the 
turret  motor.  This  compensation  term  takes  the  form  of  ucomp(kT )  =  /csign(yio2(ibr)), 
where  fc  is  the  turret  motor  friction  and  ydoiikT)  is  the  sampled  value  of  the  turret 
motor  velocity.  The  effect  of  this  nonlinear  compensation  is  to  convert  the  low  frequency 
disturbance  caused  by  the  turret  motor  friction  to  a  high  frequency  impulse  disturbance. 
The  converted  high  frequency  impulse  disturbance  has  less  effect  on  the  system  due  to  the 
lowpass  nature  of  the  designed  closed-loop  system.  The  control  law  in  (10)  now  becomes 

tid(fcT’)  =  —  KdiXdu{kT)  —  Kdoxdc(kT)  —  KdiXdi2(kT )  +  ucomp(kT)  (13) 
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4.  Simulation  results 

The  complete  digitally  implemented  control  system  of  the  ATB-1000  test  fixture  is 
shown  in  Fig.  3.  Nonlinear  simulations  are  carried  out  using  the  MATRIXx  simulation  tools 
and  the  results  are  shown  in  Figs.  4-5.  Nonlinear  simulation  of  the  closed-loop  system  using 
the  control  law  without  the  friction  compensation  is  also  carried  out  and  the  results  are 
shown  in  the  same  figures  for  comparison. 

5.  Conclusions 

A  hybrid  technique  for  optimal  discrete-time  control  of  a  continuous- time  Turret-gun 
system  has  been  proposed  in  this  paper.  A  continuous-time  linear  state-feedback  control 
law  has  been  designed  by  using  the  optimal  regional-pole  placement  technique.  This  law 
is  then  converted  to  an  equivalent  discrete-time  control  law  for  digital  implementation 
aided  by  the  digital  redesign  technique.  A  recursive  weighted  least-squares  algorithm  has 
been  utilized  to  estimate  the  digital  state  of  the  Turret-gun  system.  A  preload  compen¬ 
sation  is  also  added  to  the  digital  controller  for  reducing  the  effect  of  the  turret  motor 
friction.  Finally,  all  these  components  have  been  integrated  together  to  yield  a  practically 
implementable  controller  for  the  ATB-1000  test  fixture.  The  result  of  the  nonlinear  digital 
simulation  of  the  designed  nonlinear  Turret-gun  system  has  demonstrated  the  effectiveness 
of  the  proposed  techniques. 
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Fig.  1  Augmented  system 


Fig.  2  The  three  circular  rings  of  interest 
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without  friction  com  prana  oo 


Fig.  4  Step  responses  of  inertial  wheel  yaw 


without  friction  corapemauoo 
with  frictioa  compensation 


Fig.  5  Control  signals  applied  to  turret  motor 
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ABSTRACT. 

The  objective  of  this  writing  is  to  give  a  brief  introduction  of  the  subject  of  wavelet 
analysis,  and  to  compare  it  with  the  classical  subject  of  Fourier  analysis.  Spline  examples 
are  considered,  and  the  integral  wavelet  transform  is  considered  as  a  bandpass  filter  with 
variable  bandwidth.  A  comparison  of  some  of  the  existing  wavelets  is  also  given.  Finally,  a 
list  of  applications  of  wavelets  is  included.  During  the  hour-talk,  demonstrations  of  these 
applications  were  shown  by  taking  advantage  of  the  multimedia  facilities. 

INTRODUCTION. 

While  Fourier  analysis  is  a  well-established  subject  within  classical  analysis  and  ap¬ 
plied  mathematics,  the  subject  of  wavelet  analysis  was  born  only  during  the  last  decade. 
The  objective  of  this  writing  is  to  give  a  comparison  of  these  two  subjects.  A  brief  account 
of  the  existing  wavelets  and  a  list  of  applications  are  included  in  this  writing. 

1.  FOURIER  SERIES. 

Every  27r-periodic  function  can  be  represented  by  its  Fourier  series 

OG 

/<*)-  £  Cntpinx), 


*  Research  partially  supported  by  ARO  Contract  DAAL  03-90-G-0091. 
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where  a  single  function 


ip{x)  :=  eix  —  cosx  +  isinx 

is  used  in  the  series  representation.  Not  only  is  this  representation  very  useful,  the  terms  in 
the  series  are  also  very  meaningful:  namely,  each  term  Cnip(nx)  indicates  the  n  mode 
of  /(x),  when  /(x)  considered  is  as  a  “wave”.  In  this  regard,  only  one  “basic”  function 
xf}{x )  =  e’x,  is  needed  to  represent  all  27r-periodic  functions  /(x).  This  function  may  be 
considered  as  the  analyzing  wave. 


1.  SERIES  REPRESENTATIONS  FOR  NONPERIODIC  FUNCTIONS 

For  convenience,  we  only  discuss  the  class  L2(— oo,  oo)  of  functions  /(x)  with  finite 
energy”,  namely: 

II/II2  :=  (/  l/(*)|2<&)  <  co¬ 

in  order  to  imitate  the  Fourier  series,  we  again  look  for  a  single  “basic”  function  (also 
denoted  by  tpix))  to  represent  all  /(x)  £  £2  (— 00, 00)  in  the  form  of  an  infinite  series  as 
before. 

Assuming  that  ip(x)  should  be  “Lipschitz  continuous” ,  say,  then  for  ip{x)  £  L2(-oo,  oc), 
we  have 

ip(x)  — *  0,  as  x  — ►  ±00. 

So,  just  “dilation”  (i.e.  ip(nx))  alone  cannot  do  the  job,  and  we  need  “translation”  also. 
In  addition,  in  order  to  group  ranges  of  frequencies  (i.e.  “frequency  bands”  or  “octaves’), 
we  consider: 

“dilation  by  powers  of  2” 

instead  of  dilation  by  integers.  Also,  for  simplicity,  we  only  consider  translations  by  “inte¬ 
gers”.  That  is,  we  will  consider 

ipj,k(x)  ■=  2j/2tI>(23x  -  k),  j,  k  £  Z. 

Where,  as  usual,  Z  denotes  the  set  of  all  integers.  The  normalization  constant  2J/'2  is  used 
here  so  that 

\\lpj,kh  —  IMIa,  £  Z. 


Hence,  we  are  interested  in  scries  representations  of  the  type: 

OO  OO 

m~  E  E  Cj,k 

j~— 00  fc=— cx> 
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for  all  f(x)  €  L2(— oo,  oo).  Let  the  above  representation  of  f{x)  be  separated  into  compo¬ 
nents: 

9  OO 

/(*)  =  E  «;(*)*  with 

i--oo 

oo 

&(*)  =  E 

k  fc=— oo 

We  see  that  gj(x)  represents  the  component  of  f{x)  in  the  tt2JI  th  frequency  band”  (or  jth 
octave).  Suppose  that  the  separation 

OO 

/(*)=  E  sAx)  -  •••  +  0-i(a?)  +  so(*)  +  -** 

oo 

is  “unique”  (that  is,  we  have  an  infinite  direct-sum  decomposition  of  any  f(x)  £  L2(—oc,  oc)). 
Then  by  setting 

Wj  :=  {&:  /€  L2  (-00,00)}, 
we  have  a  direct-sum  decomposition  of  L2(— 00, 00) ,  namely: 

OO 

L2(-oo,oo)  =  ^  Wj. 

j--oo 

Next,  let  us  consider  the  partial  direct  sums: 

:=  ©  tVj- 

j  ——oo 

Then  we  arrive  at  a  nested  sequence  of  approximating  subspaces  of  L2{- 00,  oc),  namely: 

•  •  •  C  V_!  C  Vo  C  Vi  C  •  •  •  C  I2(— 00, 00),  and 

closes  Vj  =  L2{- 00,00) 
j€Z 

In  particular,  for  each  integer  n,  we  also  have 

Vn+i  =Vn®  Wn. 

That  is,  the  subspaces  Wn  axe  complementary  subspaces  of  the  nested  sequence  {Vj}. 

3.  EXAMPLE  f CARDINAL  CUBIC  SPLINES). 
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Let  Vo  denote  the  collection  of  functions  fix)  €  La(— oo,  co)  satisfying: 

(a)  /(*)  6  Ca(  —oo,  oo);  that  is,  /(x),  /'(x),  and  /"(x)  axe  continuous  for  all  x,  and 

(b)  the  restrictions  of  /(x)  on  each  of  the  intervals  [fc,  fc+1], 0,1, 

. . are  cubic  polynomials. 

Then  each  /(x)  €  Vo  is  called  a  cardinal  cubic  spline  with  knot  sequence  Z. 

For  each  j  e  Z,  consider 

Vj  =  {/(2Jx):  f(x)  e  Vo}. 

Then  every  /(x)  e  Vj  is  a  cardinal  cubic  spline  with  knots  2~J  Z.  Since  removing  a  knot  is 
equivalent  to  imposing  a  third  continuous  differentiability  condition  at  the  knot,  we  have 
a  nested  sequence  of  subspaces  of  L2(— oo,  oo),  namely: 

•  •  •  C  C  V0  C  V\  C  •  •  •  C  L2(- oo,  oo). 


From  this  nested  sequence  of  subspaces  Vn,  there  are  many  choices  of  the  complementary 
subspaces  Wn. 

Let  us  choose  those  complementary  subspaces  as  dictated  by  a  given  projection 
operator. 

(i)  L2-projection. 

Let  Pj\  L2{— oo,oo )  — ►  Vj  be  defined  by 

11/  -  P,/||2  =  min  j|/  -  h\\2 

Vj 

for  all  /  €  L2(  — oo,  oo).  So,  by  setting 

Wj-lf-Pjf:  /  €V S+i}, 

we  have 

Vj+i  =  Vj  ©  Wj. 

In  fact,  this  direct  sum  is  an  orthogonal  sum:  V3  ±.\Vr  Hence, 

ooJ_ 

L2{- oc,oo)  =  ©  Wj. 

j--oo 

Now,  considering  the  “spline  approximation”  problem: 

/~/,:=^/€VS, 


If,? 


we  then  have 


9j  ~  fj+i  ~  fj  ~  /  “  /  =  0 

for  all  sufficiently  large  j,  provided  that  /  is  sufficiently  “smooth”.  But  gj  €  Wr  So,  the 
jth  octave  of  /  is  «  0  in  very  high  frequency  ranges. 

How  about  if  /  is  smooth  on  some  intervals,  but  not  so  smooth  elsewhere? 

Since  the  cardinal  splines  are  “locally  generated”  (by  translates  of  a  B-spline) ,  we 
expect  to  have 

{&(*)  0  where  /(x )  is  “very  smooth” ; 

g-j(x)  reveals  the  details  of  /(x),  elsewhere. 

(ii)  Interpolation  at  the  knots. 

Let  Iji  C  fl  L2(— oo,  oo )  — +  Vj  be  defined  by 

for  all  /  e  C  fl  L2(- oo,  oo). 

Again  by  setting 

Wj  =  {f-Ijf:  f€Vj+l}, 

we  have 

Vj+i  =  Vj  ©  Wj, 

and  hence, 

OO 

C  n  L2(-oc,  oc)  =  ^  Wj. 

j  ——oo 

But  this  direct-sum  decomposition  is  not  an  orthogonal  decomposition. 

It  is  well  known  that  every  cubic  spline  /(x)  €  Vo  has  the  representation 


OO 

/(x)  =  v  CfciV4(x  -  k) 

fc=  —  CO 

where  {c/t}  is  square-summable 


L  icfci2< 


and  Ni(x)  is  the  cubic  B-spline.  Consider  the  orthogonal  complementary  spaces  W3\  that 
is, 

Vx  =  Vo  Wo 
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defined  by  using  L2-projection.  Then  for  every  g  €  Wo,  we  also  have  the  representation 

OO 

g(x)  =  53  d^ix  ~  k ) 

fc=— oo 

where  {dk}  is  square-summable,  and  fa (x)  is  the  cubic  B- wavelet  (see  [1]  and  Figure  2). 

While  JV4  €  Vo  has  minimum  support  in  Vo,  fa  €  Wo  also  has  minimum  support  in 
Wo-  From  Wo,  we  can  go  to  any  Wj,  j  €  Z  by  defining,  as  before, 

fa;jMx)  '■=  2P,2fa(2jx  ~  k). 

Then  every  gj  €  Wj  has  a  (unique)  representation 

OO 

9jix)  =  J!  4^4;i,Jfc(x). 

kxx—oo 

Since  every  f(x)  €  L2(— oo,  oo)  can  be  separated  as  a  direct  sum  of  Qj{x)  6  Wj,  j  €  Z,  we 
have  the  following  cubic  B- wavelet  series  representation  of  f(x): 

OO  OO 

j=— oo  fc=— oo 


4.  WAVELET  SERIES. 

Let  ip  e  L2(— oo,  oo)  and  set 

VV.fcfa)  ;=  2j/2fa2:ix  -  k ),  j,  fc  €  Z. 

Assume  that  we  have  a  ip  such  that  every  f(x)  €  L2(-oc,  oo)  has  the  (unique)  series 
expansion: 

/(x)  ~  ^  dlfa'k(x). 

jM  z 


Remarks. 

(i)  Riesz  Basis.  For  implementation,  we  would  rather  work  with  the  coefficient  sequence 
{d*.},  j,  k  €  Z,  instead  of  /(x).  This  requires  “stability”;  that  is,  the  existence  of 
positive  constants  A  and  B  (independent  of  /(x))  such  that 

aY,k\2  <  ml  <  by.w  ■ 

j,k  ],k 
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If  this  stability  condition  is  satisfied,  we  call  {xpj,k}  a  Riesz  basis  of  L2{~ oo,  oo). 

(ii)  For  a  Fourier  series,  the  Fourier  coefficients  can  be  expressed  in  terms  of  the  2ir- 
periodic  function  by  using  the  orthogonality  property  of  {emz},  namely: 


[  /(*)  ~  £;  OntinX\ 


< 


1°"  “ 


OO 


_1_ 

27T 


f{x)e~inxdx. 


Similarly,  in  the  L2(— 00,00)  setting,  if  the  Riesz  basis  {ipj,k(x)}  is  an  orthonormal 
(o.n.)  basis  of  L2(— 00, 00);  that  is, 


(tpj,k,  rfft.m)  :=  [  xpjtk(x)xpt<m{x)  dx  =  Sjit  ■  6k, m,  j,  k,e,me  Z, 
J  —oo 


then  we  also  have 

4  = 

/oo  ^ 

fix)^,k{x)  dx. 

'OO 

In  general,  if  the  Riesz  basis  {xpj,k{x)}  is  not  o.n.,  then  we  need  the  dual  (or  bi- 
orthpgonal)  basis  {xpi'k{x)},  defined  by 

0>j,k,  if t,m )  =  dj,(6k,j,  j,  Z. 


Remark. 

Although  {ipj,k(x)}  is  derived  from  a  single  function  xp(x),  the  family  {xpe'm(x)}  may 
not  come  from  one  single  function  xp.  If  it  does,  then 

xpl,rn{x)  =  ipi,m(x)  2t/2xp(2tx  -  m), 

and  we  call  xp  the  dual  of  ip. 

In  general,  if  xp  has  a  dual  xp,  then  the  series 

f{z)  =  Y.dkViA1) 

J,k 

is  called  a  wavelet  series.  Hence,  if  {xi>3,k}  is  an  o.n.  basis  of  L2(  - 00,00),  then  since 
xp  —  xp,  the  above  series  expansion  is  a  wavelet  jeries. 

The  importance  of  the  dual  xp  is  that  in  the  above  wavelet  series  expansion  of  /(x), 
we  have 

4  =  («V>  ( 


k_  _1_ 

23  ’  23 


where 


(W-f)(b,  <■)  :=  /”  * 

is  called  the  integral  wavelet  transform  (IWT)  of  f{x)  with  respect  to  the  mother  (or 
basic)  wavelet  ip.  Hence,  the  wavelet  series  expansion  of  /( i)  e  £2(— 00,00)  is  given  by 

/(*>-  E  {(»'*/)  (£-4)  }***<*>• 

j,fc€Z  x  ' 

We  conclude  this  discusssion  with  the  following  observation. 

Comparison  of  Fourier  analysis  with  Wavelet  analysis. 

(a)  In  Fourier  Analysis,  we  have  two  areas: 

(i)  Fourier  series  and 

(ii)  Integral  Fourier  Transform, 

and  they  are  not  related. 

(b)  In  Wavelet  Analysis,  we  also  have  two  areas: 

(i)  Wavelet  series  (WS)  and  . 

(ii)  Integral  wavelet  transform  (IWT), 

but  these  two  areas  are  intimately  related:  the  coefficients  dJk  of  the  WS  of  /  in 
terms  of  a  wavelet  ip  are  the  IWT  of  /  at  (£■,£),  using  the  dual  ip  of  ip  as  the 
mother  wavelet. 

5.  THE  INTEGRAL  WAVELET  TRANSFORM  AND  TIME-FREQUENCY  ANALYSIS. 

The  IWT  of  /  _ 

(WV/HM)  =  -L  J~jm  (^)  dt 

may  be  considered  as  a  “time-windowing  process” ,  where  /(t)  is  treated  as  a  continuous¬ 
time  signal.  This  window  slides  along  the  time-axis,  while  the  width  of  the  window  is 
adjusted  by  the  value  of  the  scale  a  >  0.  The  smaller  the  value  of  a,  the  narrower  the 
time-window;  and  consequently,  a  more  accurate  time-location  is  achieved. 

By  the  Parseval  identity,  we  have 

(W*/)(6,o)  =  ^  /Wt'%  (a  (w  -  ^))  du, 

where 

T](u)  ip{u)  +  u‘) 
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and  u*  is  the  “center” of  tp(u>).  Hence,  the  IWT  also  gives  localized  information  on  the 
spectral  behavior  f(u)  of  the  signal  f(t).  Again,  the  scale  a  >  0  adjusts  the  width  of  the 
frequency  window. 

If  the  frequency  u>  is  set  to  be  c/a  for  some  calibration  constant  c  >  0,  then  the  time- 
scale  plane  becomes  the  time- frequency  plane,  and  the  “time- frequency  window”  narrows 
at  high  frequencies  and  widens  at  low  frequencies. 

6.  WAVELET  DECOMPOSITION  AND  RECONSTRUCTION  ALGORITHMS. 


Recall  that 


L2(—oo,  oo)  «*  Vn  =  Wn- i  0  •  •  •  ©  Ws-m  ©  Vn-m- 


Hence,  any  fix)  €  L2(— oo,oo)  can  be  first  approximated  by  an  fy(x)  €  Vs  and  then 
fs{x)  is  decomposed  into  a  sum  of  its  “wavelet  components”  gj(x)  and  its  “blurred” 
version  / s~m{x ): 

/  *-*  In  —  9N- 1  H - 1-  9N-M  +  fs-M- 


Write 

'  OO 

fj{x)=  CkQiVx  ~  *). 

-  .  fc=-oo 

,  C"  «  (4} 

and  ^ 

J  9j(x)  =  dl^(2Jx  ~  k)> 

*  k—  —  oo 

k  =  {d{}, 

where  $  generates  Vq  (e.g.  o  —  iV^). 

Let  {afc,6fc}  be  the  “decomposition  sequences”  (relating  <p(2x  —  (!)  with  4>{x  —  k)  and 
ip(x  —  k)),  and  let  {pk,Qk}  be  the  “reconstruction  sequences”  (relating  4>(x)  with  &(2x-  £), 
and  tp(x)  with  <j>( 2x  —  £),  respectively.  Then  the  algorithms  can  be  described  as  follows: 

(i)  Decomposition  algorithm 


4  1  =  ]C  bl~2  kCe 

( 

4_1  =J2be~2kCe 


(Moving  averaging  followed  by  “downsampling”) 

dN-t  dN- 2 


,7V 


,77-1 


/ 


,7V  — 2 


/ 


dN-Af 

/ 

—  cv-'u 
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(ii)  Reconstruction  algorithm 


4  =  X^fc-2*4  1  +  L) 

i 

(“Upsampling”  followed  by  moving  averaging) 


dJV-A/ 


JV-M 


\ 


dJV-A/+l 


JV-M+I 


\ 


dN-i 

\  \ 

_ .  ~N-l  . 


,N 


7.  INTEGRAL  WAVELET  TRANSFORM  AS  BANDPASS  FILTERING. 

Let  ^(x)  be  a  mother  wavelet.  For  each  (fixed)  scale  a  >  0  (that  decides  the  frequency 
range  in  the  passband) ,  set 

Then  the  IWT  of  /  becomes 

/OO 

h{b-t)f{t)dt 

-oo 

—  (h*  f)(b). 

That  is,  for  fixed  a  >  0,  is  a  time-invariant  linear  filter  with  transfer  function  given  by 

Ha(u)  =  h(u>). 


Remarks. 

(i)  The  bandpass  filter  has  linear-phase  if  and  only  if  h(t)  is  symmetric;  that  is,  there  is 
some  to  such  that 

rb(to  +  t)  -  0(to  -  t). 

(ii)  The  bandpass  filter  has  generated  linear-phase  if  and  only  if  h(t)  is  antisymmetric; 
that  is,  there  is  some  to  such  that 

•0(to  +  t)  =  -ip(t0  -  t). 

(iii)  Linear-phase  (or  at  least  generalized  linear-phase)  ip(t)  is  essential  for  distortion-free 
filtering. 


8.  COMPARISON  OF  WAVELETS. 


The  Haar  wavelet  iph  does  not  give  good  frequency  localization.  So,  in  the  following, 
we  will  assume  that  ip  i=- ip  h- 

(i)  Compactly  supported  o.n.  wavelet  ip. 

(Note:  {ipj,k}  is  an  o.n.  basis  of  L2(— oc,  oo).) 

•  All  the  decomposition  and  reconstruction  sequences  (a,,, bn}  and  {pn,9n}  are 
finite. 

•  ip  is  not  symmetric  and  not  antisymmetric. 

•  Side  loop/main  loop  ratio  is  fairly  large. 

(See  Figure  1.) 

(ii)  B-wavelets  (of  cardinal  spline  functions)  ipm. 

•  Wn±Wj ,  n^j. 

•  {on,6„}  infinite,  {p„,<7n}  finite. 

•  ipm(x)  symmetric  for  even  m,  and  antisymmetric  for  odd  m  (with  respect  to  the 
center  2el=I). 

•  Side  loop/main  loop  ratio  is  smaller  for  m  >  2. 

(See  Figures  2  and  3.) 

(iii)  Symmetric  or  antisymmetric  compactly  supported  ip  with  finite  decompo¬ 
sition  and  reconstruction  sequences. 

•  Wn£  Wj. 

9.  APPLICATIONS. 

Real-time  wavelet  decomposition  algorithms  can  be  implemented  in  parallel,  sepa¬ 
rating  the  signal,  image,  etc.,  into  “disjoint”  frequency  bands,  with  time  (and/or  space) 
localization  (of  high  and  low  amplitudes)  in  each  band.  Furthermore,  “wavelet-packet” 
tree  algorithms  can  be  implemented,  even  adaptively,  to  further  decompose  the  compo¬ 
nents  in  the  high-frequency  ranges.  Since  the  real-time  wavelet  reconstruction  algorithms 
are  equally  efficient,  it  is  easy  to  imagine  that  the  list  of  potential  applications  of  wavelets 
is  endless.  We  only  discuss  a  few  examples  here. 

(i)  Analysis  of  transient  signals 

(ii)  Sonar  applications 

(iii)  Echo  detection  and  cancellation 

(iv)  Noise  removal  (e.g.  acoustic  pop  noise) 
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(v)  Image  compression 

Demonstrations  of  all  these  items  were  presented  in  the  lecture. 

Reference 

1.  C.  K.  Chui,  An  Introduction  to  Wavelets,  Academic  Press,  Boston,  1992. 
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Abstract 

We  have  developed  parallel  algorithms  and  data  structures  for  a  variety  of  numeric  and 
nonnumeric  problems  and  analyzed  their  performance  and  scalability  on  various  parallel  archi¬ 
tectures.  Our  analysis  sheds  light  on  what  problems  can  be  solved  cost  effectively  on  large-scale 
parallel  computers.  It  also  gives  us  insights  into  the  best .  possible  parallel  architectures  for 
solving  various  problems  of  practical  interest.  This  paper  presents  an  overview  of  our  research. 
In  particular,  it  summarizes  results  on  development  and  analysis  of  parallel  algorithms  such  as 
load  balancing  of  unstructured  tree  computations,  finite  element  method,  sparse  linear  system 
solvers,  backpropagation  neural  network  learning  algorithm  and  fast  fourier  transforms. 


1  Introduction. 

This  research  addresses  the  problem  of  exploiting  the  massive  computation  power  of  large  scale 
parallel  computers.  It  is  possible  to  construct  parallel  processors  containing  10’s  or  100’s  of  thou¬ 
sands  processors.  The  cost  of  these  machines  is  comparable  to  that  of  large  mainframes  ($  1  to  20 
Million),  but  they  offer  100  to  10,000  fold  more  raw  computing  power.  There  are  many  important 
industrial/military  problems  that  can  become  solvable  with  the  100  to  10,000  fold  increment  in 
computing  power.  It  is  not  possible  to  exploit  this  massive  power  until  scalable  parallel  algorithms 
are  developed  for  the  problems  of  interest.  We  are  developing  parallel  algorithms  for  a  variety 
of  numeric  and  non-numeric  problems  and  analyzing  their  performance  and  scalability  on  various 
parallel  architectures.  The  scalability  analysis  is  important  because  the  hardware  technology  is 
changing  rapidly  (in  terms  of  number  of  processors,  computation  and  communication  speeds  and 
network  technology)  and  experimental  results  for  a  problem  on  a  specific  architecture  may  become 
obsolete  with  any  of  these  changes.  This  research  will  help  answer  the  following  questions:  (i) 
what  problems  can  be  solved  cost  effectively  on  parallel  computers?  (ii)  what  are  the  best  parallel 
algorithms  and  architectures  for  solving  various  problems  of  interest? 

We  have  used  the  isoefficiency  metric  to  analyze  well  known  techniques  for  partitioning  finite  el¬ 
ement  meshes.  Our  analysis  predicts  the  relative  performance  of  these  schemes  on  different  parallel 
architectures  and  helps  in  determining  more  scalable  schemes.  We  have  analyzed  the  performance 

‘This  work  was  supported  by  IST/SDIO  through  the  Army  Research  Office  grant  #  28408-MA-SDI  to  the  Univer¬ 
sity  of  Minnesota  and  by  the  Army  High  Performance  Computing  Research  Center  at  the  University  of  Minnesota. 


373 


and  scalability  of  the  Preconditioned  Conjugate  Gradient  Algorithm  on  parallel  architectures  such 
as  mesh,  hypercube  and  CM5rvf  1  for  different  types  of  sparse  matrices.  We  have  predicted  the 
relative  performance  of  different  mappings  and  the  scalability  of  different  preconditioners  on  large 
scale  parallel  computers.  The  analytical  results  have  been  verified  through  experimental  implemen¬ 
tations  of  these  schemes  on  CM5.  We  have  developed  a  new  and  highly  scalable  network  partitioning 
method  for  mapping  the  Backpropagation  Algorithm  for  parallel  computers  such  as  nCUBE2rw  2 
and  CM5.  Even  an  unoptimized  version  of  our  hybrid  parallel  formulation  on  a  256  processor  CM5 
(without  vector  units)  performs  over  60  million  weight  changes  per  second  (or  over  180  million 
connections  per  second)  for  irregular  networks.  We  have  developed  highly  scalable  methods  for 
load  balancing  of  unstructured  tree  computations  on  on  both  MIMD  and  SIMD  machines.  We 
have  studied  the  scalability  and  the  cost-performance  tradeoffs  for  the  FFT  algorithm  on  different 
architectures.  This  analysis  highlights  the  dramatic  impact  of  certain  hardware  parameters  or.  the 
scaiability  and  performance  of  the  FFT  algorithm.  We  have  presented  new  parallel  algorithms 
for  matrix  multiplication,  and  have  analyzed  the  performance  and  scalability  of  these  and  existing 
algorithms  for  a  variety  of  architectures. 

In  subsequent  sections  we  provide  an  extended  summary  of  our  research  results. 


2  Analyzing  the  scalability  of  parallel  algorithms  and  architec¬ 
tures. 

At  the  given  state  of  technology,  it  is  possible  to  construct  parallel  computers  employing  hundreds 
of  thousands  of  processors.  Availability  of  such  systems  has  fueled  interest  in  investigating  the 
performance  of  parallel  computers  containing  a  large  number  of  processors.  The  scalability  of  a 
parallel  algorithm  on  a  parallel  architecture  is  a  measure  of  its  capability  to  effectively  utilize  an 
increasing  number  of  processors.  Scalability  analysis  of  a  parallel  algorithm-architecture  combina¬ 
tion  can  be  used  for  a  variety  of  purposes.  It  may  be  used  to  select  the  best  algorithm-architecture 
combination  for  a  problem  under  different  constraints  on  the  growth  of  the  problem  size  and  the 
number  of  processors.  It  may  be  used  to  predict  the  performance  of  a  parallel  algorithm  and  a 
parallel  architecture  for  a  large  number  of  processors  from  the  known  performance  on  fewer  pro¬ 
cessors.  For  a  fixed  problem  size  it  may  be  used  to  determine  the  optimal  number  of  processors 
to  be  used  and  the  maximum  possible  speedup  that  can  be  obtained.  The  scalability  analysis  can 
also  predict  the  impact  of  changing  hardware  technology  on  the  performance  and  thus  help  design 
better  parallel  architectures  for  solving  various  problems. 

A  number  of  metrics  for  scalability  analysis  have  been  developed  [21).  In  [23],  we  presented 
isoefficiency  as  a  metric  for  characterizing  the  scalability  of  parallel  algorithm  -  architecture  com¬ 
binations.  If  a  parallel  algorithm  is  used  to  solve  a  problem  instance  of  a  fixed  size,  then  the 
efficiency  decreases  as  number  of  processors  P  increases.  The  reason  is  that  the  total  overhead 
increases  with  P.  For  many  parallel  algorithms,  for  a  fixed  P,  if  the  problem  size  W  is  increased, 
then  the  efficiency  becomes  higher  (and  approaches  1),  because  the  total  overhead  grows  slower 
than  W.  For  these  parallel  algorithms,  the  efficiency  can  be  maintained  at  a  desired  value  (between 
0  and  1)  with  increasing  number  of  processors,  provided  the  problem  size  is  also  increased.  We  call 

'CMS  is  a  trademark  of  the  Thinking  Machines  Corporation. 

2nCUBE2  is  a  trademark  of  the  nCUBE  corporation. 


374 


such  algorithms  scalable  parallel  algorithms. 

Note  that  for  a  given  parallel  algorithm,  for  different  parallel  architectures,  the  problem  size 
may  have  to  increase  at  different  rates  w.r.t.  P  in  order  to  maintain  a  fixed  efficiency.  The  rate 
at  which  W  is  required  to  grow  w.r.t.  P  to  keep  the  efficiency  fixed  is  essentially  what  determines 
the  degree  of  scalability  of  the  parallel  algorithm  for  a  specific  architecture.  For  example,  if  W 
is  required  to  grow  exponentially  w.r.t.  P,  then  the  algorithm-architecture  combination  is  poorly 
scalable.  The  reason  for  this  is  that  in  this  case  it  would  be  difficult  to  obtain  good  speedups  on  the 
architecture  for  a  large  number  of  processors,  unless  the  problem  size  being  solved  is  enormously 
large.  On  the  other  hand,  if  W  needs  to  grow  only  linearly  w.r.t.  P,  then  the  algorithm-architecture 
combination  is  highly  scalable  and  can  easily  deliver  linearly  increasing  speedups  with  increasing 
number  of  processors  for  reasonable  problem  sizes.  If  W  needs  to  grow  as  /( P)  to  maintain  an 
efficiency  E,  then  f(P )  is  defined  to  be  the  isoefficiency  function  for  efficiency  E  and  the  plot 
c *  }{P)  w.r.t.  P  is  defined  to  be  the  isoefficiency  curve  for  efficiency  E.  A  lower  bound  on 
any  isoefficiency  function  is  that  asymptotically,  it  should  be  at  least  linear.  This  follows  from  the 
fact  that  all  problems  have  a  sequential  (z.e.  non  decomposable)  component.  Hence  any  algorithm 
which  shows  a  linear  isoefficiency  on  some  architecture  is  optimally  scalable  on  that  architecture. 
Algorithms  with  isoefficiencies  of  0(P  \ogc  P),  for  small  constant  c,  are  also  reasonably  optimal  for 
practical  purposes. 

In  [21],  we  critically  assc  s  the  state  of  the  art  in  the  theory  of  scalability  analysis,  and  to 
motivate  further  research  on  the  development  of  new  and  more  comprehensive  analytical  tools  to 
study  the  scalability  of  parallel  algorithms  and  architectures.  We  survey  a  number  of  techniques 
and  formalisms  that  have  been  developed  for  studying  the  scalability  issues,  and  discuss  their 
interrelationships.  We  show  some  interesting  relationships  between  the  technique  of  isoefficiency 
analysis  developed  in  [23]  and  many  other  methods  for  scalability  analysis.  We  point  out  some  of 
the  weaknesses  of  the  existing  schemes,  and  discuss  possible  ways  of  extending  them. 

In  [12],  we  study  the  impact  of  parallel  processing  overheads  and  the  degree  of  concurrency  of  a 
parallel  algorithm  on  the  optimal  number  of  processors  to  be  used  when  the  criterion  for  optimality 
is  minimizing  the  parallel  execution  time.  We  also  study  a  more  general  criterion  of  optimality  and 
show  how  operating  at  the  optimal  point  is  equivalent  to  operating  at  a  unique  value  of  efficiency 
which  is  characteristic  of  the  criterion  of  optimality  and  the  properties  of  the  parallel  system  under 
study.  In  this  paper,  we  also  show  how  the  paper  generalizes  and/or  extends  earlier  results  of  many 
other  researchers. 

3  Scalability  Analysis  of  Load  Balancing  Algorithms. 

Load  balancing  is  perhaps  the  central  aspect  of  parallel  computing.  Before  a  problem  can  be  exe¬ 
cuted  on  a  parallel  computer,  the  work  to  be  done  has  to  be  partitioned  among  different  processors. 
Due  to  uneven  processor  utilization,  load  imbalance  can  cause  poor  efficiency.  In  [20],  we  have  an¬ 
alyzed  the  problem  of  load  balancing  in  multiprocessors  for  those  parallel  algorithms  that  have  the 
following  characteristics. 

•  The  work  available  at  any  processor  can  be  partitioned  into  independent  work  pieces  as  long 
as  it  is  more  than  some  non-decomposabie  unit. 
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•  The  cost  of  splitting  and  transferring  work  to  another  processor  is  not  excessive,  (i.e.  the  cost 
associated  with  transferring  a  piece  of  work  is  much  less  than  the  computation  cost  associated 
with  it.) 

•  A  reasonable  work  splitting  mechanism  is  available;  i.e.,  if  work  w  at  one  processor  is  par¬ 
titioned  in  2  parts  ipw  and  (1  —  ip)w,  then  1  —  a  >  ^  >  a,  where  o  i-  an  arbitrarily  small 
constant. 

•  It  is  not  possible  (or  is  very  difficult)  to  estimate  the  size  of  total  work  at  a  given  processor. 

Although,  in  such  parallel  algorithms,  it  is  easy  to  partition  the  work  into  arbitrarily  many 
parts,  these  parts  can  be  of  widely  differing  sizes.  Hence  after  an  initial  distribution  of  work  among 
P  processors,  some  processors  may  run  out  of  work  much  sooner  than  others;  therefore  a  dynamic 
balancing  of  load  is  needed  to  transfer  work  from  processors  that  have  work  to  the  ones  that  are 
idle.  Since  none  of  the  processors  (that  have  work)  know  how  much  work  they  have,  load  balancing 
schemes  which  require  this  knowledge  (eg.  [16,  18])  are  not  applicable.  The  performance  of  a  load 
balancing  scheme  is  dependent  upon  the  degree  of  load  balance  achieved  and  the  overheads  due  to 
load  balancing. 

Work  created  in  the  execution  of  many  tree  search  algorithms  used  in  artificial  intelligence 
and  operations  research  [22,  27]  and  many  divide-and-conquer  algorithms  [15]  satisfy  all  the  re¬ 
quirements  stated  above.  As  an  example,  consider  the  problem  of  searching  a  state-space  tree  in 
depth-first  fashion  to  find  a  solution.  The  state  space  tree  can  be  easily  split  up  into  many  parts 
and  each  part  can  be  assigned  to  a  different  processor.  Although  it  is  usually  possible  to  come  up 
with  a  reasonable  work  splitting  scheme  [26],  different  parts  can  be  of  radically  different  sizes,  and 
in  general  there  is  no  way  of  estimating  the  size  of  a  search  tree. 

A  number  of  dynamic  load  balancing  strategies  that  are  applicable  to  problems  with  these 
characteristics  have  been  developed.  Many  of  these  schemes  have  been  experimentally  tested  on 
some  physical  parallel  architectures.  From  these  experimental  results,  it  is  difficult  to  ascertain 
relative  merits  of  different  schemes. 

In  [20,  9],  we  have  been  able  to  determine  the  most  scalable  load  balancing  schemes  for  dif¬ 
ferent  architectures  such  as  hypercube,  mesh  and  network  of  workstations.  For  each  architecture, 
we  have  established  lower  bounds  on  the  scalability  of  any  possible  load  balancing  scheme.  We 
present  the  scalability  analysis  of  a  number  of  load  balancing  schemes  that  have  not  been  analyzed 
before.  From  this  we  gain  valuable  insights  into  which  schemes  can  be  expected  to  perform  better 
under  what  problem  and  architecture  characteristics.  For  each  of  these  architectures,  we  are  able 
to  determine  near  optimal  load  balancing  schemes.  In  particular,  some  of  the  algorithms  analyzed 
here  for  hypercubes  are  more  scalable  than  those  presented  in  [23].  Results  obtained  from  imple¬ 
mentation  of  these  schemes  in  the  context  of  the  Tautology  Verification  problem  on  the  Ncube/2™ 
multicomputer  are  used  to  validate  theoretical  results  for  the  hypercube  architecture. 

In  [17],  we  present  new  methods  for  load  balancing  of  unstructured  tree  computations  on  large- 
scale  SIMD  machines,  and  analyze  the  scalability  of  these  and  existing  schemes.  An  efficient 
formulation  of  tree  search  on  a  SIMD  machine  comprises  of  two  major  components:  (i)  a  triggering 
mechanism,  which  determines  when  the  search  space  redistribution  must  occur  to  balance  search 
space  over  processors;  and  (ii)  a  scheme  to  redistribute  the  search  space.  We  have  devised  a 
new  redistribution  mechanism  and  a  new  triggering  mechanism.  Either  of  these  can  be  used  in 
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conjunction  with  triggering  and  redistribution  mechanisms  developed  by  other  researchers.  We 
analyze  the  scalability  of  these  mechanisms,  and  verify  the  results  experimentally.  The  analysis  and 
experiments  show  that  our  new  load  balancing  methods  are  highly  scalable  on  SIMD  architectures. 
In  particular,  their  scalability  is  no  worse  than  that  of  the  best  load  balancing  schemes  on  MIMD 
architectures. 

4  Scalability  analysis  of  parallel  formulations  of  the  Fast  Fourier 
Transform  algorithm. 

Fast  Fourier  Transform  plays  an  important  role  in  several  scientific  and  technical  applications. 
Some  of  the  applications  of  the  FFT  algorithm  include  Time  Series  and  Wave  Analysis,  solving 
Linear  Partial  Differential  Equations,  Convolution,  Digital  Signal  Processing  and  Image  Filtering, 
etc.  Hence,  there  has  been  a  great  interest  in  implementing  FFT  on  parallel  computers.  In  [11], we 
analyze  the  scalability  of  a  commonly  used  parallel  formulation  of  FFT  [30]  on  mesh  and  hypercube 
connected  multicomputers.  We  also  present  experimental  performance  results  on  a  1024- processor 
Ncube/1  multicomputer  to  support  analytical  results. 

The  scalability  analysis  of  FFT  on  hypercube  provides  several  important  insights.  On  the 
hypercube  architecture,  the  parallel  FFT  algorithm  can  obtain  linearly  increasing  speedup  with 
respect  to  the  number  of  processors  with  only  a  moderate  increase  in  problem  size.  This  is  not 
surprising  in  the  light  of  the  fact  that  the  FFT  computation  maps  naturally  to  the  hypercube 
architecture  [28].  But  there  is  a  limit  on  the  achievable  efficiency  which  is  determined  by  the  ratio 
of  CPU  speed  and  communication  bandwidth  of  the  hypercube  channels.  This  limit  can  be  raised 
by  increasing  the  bandwidth  of  the  communication  channels.  Efficiencies  higher  than  this  limit  can 
be  obtained  only  if  the  problem  size  is  increased  very  rapidly.  The  technology  dependent  features 
such  as  the  communication  bandwidth  determine  an  upper-bound  on  the  overall  performance  that 
one  could  obtain  from  a  p-processor  system  for  a  give  n  problem  size.  Thus  if  the  processing  speed 
of  the  CPU  used  in  a  p-processor  hypercube  is  increased  (and  if  these  other  factors  are  not  changed), 
then  the  overall  performance  does  not  increase  beyond  a  point.  An  interesting  insight  is  that  this 
upper-bound  can  be  improved  by  either  increasing  the  problem  size  exponentially  or  by  improving 
the  communication  related  parameters  linearly. 

From  the  scalability  analysis,  we  found  that  the  FFT  algorithm  cannot  make  efficient  use  of 
large-scale  mesh  architectures  unless  the  communication  bandwidth  is  increased  as  a  function  of 
the  number  of  processors.  If  the  width  of  inter-processor  links  is  maintained  as  0(  ^/p),  where 
p  is  the  number  of  processors  on  the  mesh,  then  the  scalability  can  be  improved  considerably. 
Addition  of  features  such  as  cut-through-routing  (also  known  as  worm-hole  routing)  [3]  to  the  mesh 
architecture  improve  the  scalability  of  several  parallel  algorithms;  e.g.,  see  [25]  .  But  these  features 
do  not  improve  the  overall  scalability  characteristics  of  the  FFT  algorithm  on  this  architecture. 
We  also  show  that  if  the  cost  of  a  communication  network  is  proportional  to  the  total  number  of 
communication  links,  then  it  is  more  cost-effective  to  implement  the  FFT  algorithm  on  a  hypercube 
rather  than  a  mesh  despite  the  fact  that  large  scale  meshes  are  cheaper  to  construct  than  large 
hypercubes. 

We  have  used  the  single  dimensional  unordered  radix-2  FFT  algorithm  for  a  major  part  of  the 
analysis  and  for  obtaining  experimental  results.  This  is  the  simplest  form  of  FFT  and  not  the  most 
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efficient  one.  It  is  shown  through  similar  analysis  for  multidimensional,  ordered  and  higher  radix 
algorithms,  some  of  which  are  more  efficient  than  the  simple  algorithm  given  here,  that  the  nature 
of  the  results  does  not  change  [11]. 

5  Scalability  analysis  of  parallel  formulations  of  the  Precondi¬ 
tioned  Conjugate  Gradient  method. 

In  [13],  we  study  performance  and  scalability  of  parallel  formulations  of  the  Preconditioned  Conju¬ 
gate  Gradient  (PCG)  algorithm  [7]  for  solving  large  sparse  linear  systems  of  equations  of  the  form 
A  x  =  b,  where  A  is  a  symmetric  positive  definite  matrix.  A  linear  system  of  equations  is  often 
preconditioned  to  accelerate  the  rate  of  convergence  of  the  CG  algorithm.  In  this  paper,  the  use 
of  two  such  preconditioning  methods  is  considered  -  the  diagonal  preconditioner  and  that  resulting 
from  the  Incomplete  Cholesky  (IC)  factorization  of  the  matrix  of  coefficients.  Two  different  kinds 
of  matrices  are  considered.  First  the  scalability  of  the  PCG  algorithm  with  penta-diagonal  matrices 
resulting  from  two  dimensional  square  or  rectangular  finite  difference  grids  with  natural  ordering 
of  grid  points  is  analyzed.  Two  commonly  used  schemes  for  mapping  the  data  on  the  processors 
are  compared  and  one  is  shown  to  be  strictly  better  than  the  other  one.  These  results  are  then 
extended  to  the  matrices  resulting  from  three  dimensional  finite  difference  grids.  The  second  type  of 
matrices  that  are  studied  are  randomly  sparse  symmetric  positive  definite  matrices.  The  analytical 
results  are  then  verified  through  extensive  experiments  on  the  CMS  parallel  computer.  Apart  from 
the  basic  questions  answered  by  isoefficiency  analysis,  this  analysis  helps  in  answering  a  number  of 
other  questions,  such  as  - 

Which  feature  of  the  hardware  should  be  improved  for  maximum  returns  in  terms  of  perfor¬ 
mance  per  unit  cost? 

How  does  the  Incomplete  Cholesky  (IC)  preconditioner  compare  with  a  simple  diagonal  pre¬ 
conditioner  in  terms  of  parallel  performance? 

What  kind  of  improvement  in  scalability  can  be  achieved  by  re-ordering  the  sparse  matrix? 

Which  parts  of  the  algorithm  dominate  in  terms  of  communication  overheads  and  hence 
determine  the  overall  parallel  speedup  and  efficiency? 

Although,  we  specifically  deal  with  the  Preconditioned  CG  algorithm  only,  the  analysis  pertain¬ 
ing  to  the  diagonal  preconditioner  case  applies  to  the  non-preconditioned  method  also.  In  fact  the 
results  of  the  entire  paper  can  be  adapted  for  a  number  of  iterative  methods  that  use  matrix-vector 
multiplication  and  vector  inner  product  calculation  as  the  basic  operations  in  each  iteration. 

It  is  shown  that  for  such  matrices,  the  computation  of  vector  inner  products  dominates  the 
rest  of  the  computation  in  terms  of  communication  overheads.  However,  with  a  suitable  mapping, 
the  parallel  formulation  of  the  CG  algorithm  is  highly  scalable  for  such  matrices  on  a  machine 
like  the  CM5  whose  fast  control  network  practically  eliminates  the  overheads  due  to  inner  product 
computation.  A  method  of  using  the  Incomplete  Cholesky  (IC)  preconditioner  is  presented  that 
leads  to  a  further  improvement  in  scalability  on  the  CM5  by  a  constant  factor.  As  a  result,  if 
enough  processors  are  used,  then  a  parallel  implementation  with  the  IC  preconditioner  may  execute 


378 


faster  than  that  with  a  simple  diagonal  preconditioner  even  if  the  latter  executed  faster  in  a  serial 
implementation.  For  hepta-diagonal  matrices  resulting  from  three  dimensional  finite  difference 
grids,  the  scalability  is  quite  good  on  a  hypercube  or  the  CM5,  but  net  as  good  on  a  '2-D  mesh 
such  as  Intel  Touchstone  machine.  In  case  of  a  random  sparse  matrix  with  a  constant  number  of 
non-zero  elements  in  each  row,  the  parallel  formulation  of  the  CG  method  is  unscalable  on  any 
parallel  architecture.  But  the  parallel  system  can  be  rendered  scalable  either  if,  after  reordering, 
the  non-zero  elements  of  the  N  x  N  matrix  can  be  confined  in  a  band  whose  width  is  0(Ny)  for  any 
y  <  1,  or  if  the  number  of  non- zero  elements  per  row  increases  as  Nx  for  any  x  >  0.  The  scalability 
increases  as  the  number  of  non-zero  elements  per  row  is  increased  and/or  the  width  of  the  band 
containing  these  elements  is  reduced.  For  random  sparse  matrices,  the  scalability  is  asymptotically 
the  same  for  all  architectures. 

6  Scalability  analysis  of  parallel  algorithms  for  matrix  multipli¬ 
cation. 

Matrix  multiplication  is  widely  used  in  a  variety  of  applications  and  is  often  one  of  the  core  compo¬ 
nents  of  many  scientific  computations.  Since  the  matrix  multiplication  algorithm  is  highly  compu¬ 
tation  intensive,  there  has  been  a  great  deal  of  interest  in  developing  parallel  formulations  of  this 
algorithm  and  testing  its  performance  on  various  parallel  architectures. 

Some  of  the  early  parallel  formulations  of  matrix  multiplication  were  developed  by  Canon  [2], 
Dekel,  Nassimi  and  Sahni  [4],  and  Fox  et.  al.  [5].  Variants  and  improvements  of  these  algorithms 
have  been  presented  in  [1,  14].  In  particular,  Berntsen  [1]  presents  an  algorithm  which  has  a  strictly 
smaller  communication  overhead  than  Canon’s  algorithm,  but  has  a  smaller  degree  of  concurrency. 
Ho  and  Johnsson  [14]  present  another  variant  of  Canon’s  algorithm  for  a  hypercube  which  permits 
communication  on  all  channels  simultaneously.  This  algorithm  too,  while  reducing  communication, 
also  reduces  the  degree  of  concurrency. 

In  [10],  we  use  the  isoefficiency  metric  [23]  to  analyze  the  scalability  of  a  number  of  parallel 
formulations  of  the  matrix  multiplication  algorithm  for  a  wrap-around  mesh,  hypercube  and  related 
architectures.  We  analyze  the  performance  of  various  parallel  formulations  of  the  matrix  multipli¬ 
cation  algorithm  for  different  matrix  sizes  and  number  of  processors,  and  predict  the  conditions 
under  which  each  formulation  is  better  than  the  others.  We  present  a  new  parallel  algorithm  for 
the  hypercube  and  related  architectures  that  performs  better  than  any  of  the  schemes  described  in 
the  literature  so  far  for  a  wide  range  of  matrix  sizes  and  number  of  processors.  The  superior  perfor¬ 
mance  and  the  analytical  scalability  expressions  for  this  algorithm  are  verified  through  experiments 
on  the  CM5  parallel  computer  for  upto  512  processors.  We  show  that  special  hardware  permitting 
simultaneous  communication  on  all  the  ports  of  the  processors  does  not  improve  the  overall  seal- 
ability  of  the  matrix  multiplication  algorithms  on  a  hypercube.  We  also  discuss  the  dependence 
of  scalability  of  parallel  matrix  multiplication  algorithms  on  technology  dependent  factors  such  as 
communication  and  computation  speeds  and  show  that  under  certain  conditions,  it  may  be  better 
to  have  a  parallel  computer  with  i-fold  as  many  processors  rather  than  one  with  the  same  number 
of  processors,  each  fc-fold  as  fast. 
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7  Scalability  analysis  of  partitioning  techniques  for  finite  ele¬ 
ment  graphs. 

Parallel  formulations  of  finite  element  techniques  require  a  mapping  of  the  elements  onto  processors, 
and  the  performance  of  the  overall  formulation  is  a  very  sensitive  function  of  this  mapping.  Any 
mapping  of  elements  to  processors  must  try  to  satisfy  the  following  criteria: 

1.  The  ratio  of  computation  to  communication  associated  with  elements  on  a  processor  should 
be  maximized. 

2.  Locality  of  communication  should  be  preserved. 

3.  The  computational  load  should  be  balanced  to  the  extent  possible. 

These  conditions  represent  the  classical  communication  -  load  imbalance  tradeoffs.  Optimizing 
one  of  these  criterion  leads  to  a  deterioration  with  respect  to  one  or  more  of  the  other  criteria. 
The  mapping  problem  in  its  optimal  form  is  known  to  be  NP  complete  even  for  simple  models 
of  computation  and  communication  costs  [6].  Hence,  a  number  of  heuristic  approaches  have  been 
presented  to  derive  reasonable  suboptimal  partitions  in  a  reasonable  amount  of  time.  All  of  these 
try  to  balance  the  various  tradeoffs  mentioned.  Most  of  these  schemes  have  been  evaluated  only  on 
specific  parallel  computers  for  certain  problems. 

In  [8],  we  perform  scalability  analysis,  using  the  Isoefficiency  metric  [24,  25],  of  three  partitioning 
algorithms,  namely,  striped  partitioning,  binary  decomposition,  and  scattered  decomposition.  This 
helps  us  establish  the  relative  performance  of  these  schemes  over  a  range  of  processors,  and  the 
effect  of  communication  related  parameters  on  the  performance  of  these  schemes.  We  also  relate  the 
performance  of  each  of  these  schemes  to  the  various  problem  characteristics  such  as  mesh  geometry 
and  density.  The  theoretical  results  are  verified  through  simulations. 

8  Scalable  Parallel  Formulations  of  the  Backpropagation  Algo¬ 
rithm. 

The  Backpropagation  algorithm  (BP){29]  is  one  of  the  most  popular  neural  network  learning  algo¬ 
rithms.  It  has  been  used  in  a  large  number  of  applications. 

BP  can  be  parallelized  either  by  network  partioning  or  by  pattern  partitioning.  In  network 
partitioning  schemes,  nodes  and  weights  of  the  neural  network  are  partitioned  among  different 
processors  and  thus  the  computations  of  node  activations,  node  errors  and  weight  changes  are 
parallelized.  In  pattern  partitioning,  individual  weight  changes  due  to  various  learning  patterns 
are  computed  concurrently.  Pattern  partitioning  and  network  partitioning  can  also  be  combined  to 
form  hybrid  schemes.  Several  machine  architectures  including  linear  arrays,  meshes  and  hypercubes 
have  been  explored  to  implement  parailel  BP. 

In  [19],  we  present  a  new  technique  for  mapping  the  backpropagation  algorithm  on  hypercubes 
and  related  architectures.  A  key  component  of  this  technique  is  a  network  partitioning  scheme 
which  is  called  checkerboarding.  The  major  communication  intensive  operation  in  the  commonly 
used  vertical  sectioning  scheme  is  the  all-to-all  broadcast.  In  this  operation,  each  of  the  P  processor 
has  to  broadcast  its  local  m  units  of  information  to  all  other  processors.  This  takes  Q(mP)  time 
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on  linear  array,  mesh  as  well  as  hypercube.  Hence,  the  vertical  sectioning  scheme  performs  equally 
well  (or  equally  poorly)  on  ?li  these  architectures. 

The  checkerboarding  o  '.eme  allows  us  to  replace  the  all-to-all  broadcast  operation  by  concurrent 
non-interfering  single  mrce  broadcasts,  which  are  much  faster  on  hypercubes.  Furthermore,  our 
method  can  use  a  larger  number  of  processors  without  incurring  higher  communication  costs  than 
the  vertical  sectioning  scheme,  and  shows  strictly  better  performance  on  hypercubes  with  more 
than  HS  processors.  Our  scheme  also  performs  better  than  pattern  partitioning  scheme  for  a  large 
clr..-  of  problems.  In  addition,  our  scheme  can  be  combined  with  the  pattern  partitioning  scheme  to 
form  a  hybrid  scheme  which  performs  better  than  either  one  for  a  wider  variety  of  cases.  Although 
the  scheme  is  natural  for  the  hypercube  architecture  (e.g.,  nCUBE,  Intel  iPSC™3),  it  is  equally 
suitable  for  Fat-tree  based  architectures  such  as  CM5.  Our  scheme  is  applicable  only  to  fully 
connected  networks;  i.e,,  each  node  in  a  layer  (except  the  output  layer)  is  connected  to  all  nodes  in 
the  next  layer.  The  number  of  nodes  in  each  layer  can  be  the  same  (uniform  network)  or  different 
(non-uniform  network). 

Experimented  results  on  nCUBE2  and  CM5  show  that  our  scheme  performs  better  than  the  other 
schemes  for  both  uniform  and  non-uniform  networks.  Furthermore,  it  provides  very  high  overall 
performance  on  existing  commercial  parallel  computers.  For  example,  an  unoptimized  version  of 
our  hybrid  parallel  formulation  on  a  256  processor  CM5  (without  vector  units)  performs  over  50 
million  weight  changes  per  second  (or  over  160  million  connections  per  second)  for  non-uniform 
networks.  With  the  upgrade  of  processors  on  the  CMS,  this  performance  is  expected  to  improve 
by  1  or  2  orders  of  magnitude. 

9  Concluding  Remarks. 

Development  of  efficient  parallel  algorithms  and  the  understanding  of  their  scalability  for  the  prob¬ 
lems  mentioned  above  is  very  useful  for  both  military  and  industrial  applications.  Finite  element 
and  finite  difference  methods  are  widely  used  in  modeling  fluid  flows,  fluid  dynamics  and  many 
other  applications.  Unstructured  tree  search  algorithms  such  as  Branch  and  Bound  are  used  to 
solve  combinatorial  optimization  problems  such  as  resource  allocation,  logistics  and  transportation 
problems,  etc.  Dynamic  load  balancing  techniques  play  a  very  important  role  in  various  prob¬ 
lems  including  discrete  event  modeling  such  as  battlefield  simulations.  Neural  Network  learning 
algorithms  suet;  as  Backpropagation  are  important  components  of  target  ecognition  systems.  The 
FFT  algorithm  is  an  integral  part  of  many  applications  that  involve  signal  processing.  Our  research 
has  led  to  a  better  understanding  of  the  relative  merits  of  various  parallel  formulations  of  different 
algorithms  and  development  of  new  and  more  scalable  formulations  for  many  of  these  problems. 
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Richard  A.  Weiss 

U.S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT.  The  basic  equations  of  kinematics  and  dynamics  are  derived  for 
particles  moving  in  space  and  time  with  broken  internal  symmetries.  Within  this 
formalism  the  space  and  time  coordinates ,  velocities  and  accelerations  of 
particles  must  be  described  as  complex  numbers  having  magnitudes  and  phase  angles 
which  can  be  variables.  Incoherent  spacetime  is  associated  with  changes  in  the 
magnitudes  of  the  space  and  time  coordinates  while  coherent  spacetime  is 
associated  with  the  rotation  of  the  coordinates  in  an  internal  space.  Newton's 
law  of  dynamics  is  formulated  for  broken  symmetry  spacetime  in  four  ways  which 
allow  the  description  of  four  limiting  kinematic  states  of  motion:  Incoherent 
space  and  incoherent  time,  coherent  space  and  incoherent  time,  incoherent  space 
and  coherent  time,  and  finally  coherent  space  and  coherent  time.  The 
relationship  between  the  measured  velocity  and  acceleration  of  particles  and  the 
conventionally  calculated  values  of  velocity  and  acceleration  is  determined  for 
spacetime  with  broken  internal  symmetries.  An  application  to  the  internal  space 
and  time  motions  of  the  harmonic  oscillator  is  presented  as  an  elementary  example 
of  the  theoretical  formalism. 

1 .  INTRODUCTION .  It  has  been  suggested  that  the  thermodynamic  functions 
such  as  pressure,  internal  energy  and  entropy  are  skewed  in  an  internal  space, 
and  must  be  represented  by  complex  nunfoers  with  internal  phase  angles.1  This 
conclusion  follows  from  the  assumption  of  the  validity  of  a  relativistic  trace 
equation  which  introduces  the  effects  of  the  Gruneisen  function  and  the  bulk 
modulus  on  the  relativistic  state  equation  for  a  material  system.1,2  The  broken 
symmetry  nature  of  the  pressure  combined  with  Euler's  equations  of  motion  for 
fluids  suggest  that  the  space  and  time  coordinates  within  matter  have  internal 
phase  angles  and  must  be  represented  as  complex  n'lmbers  in  an  internal  space.1 
It  then  follows  naturally  that  the  kinematic  quantities  such  as  particle 
velocity,  momentum,  acceleration  and  energy  also  have  internal  phase  angles. 
Dynamical  quantities  such  as  force,  Lagrangian,  Hamiltonian  and  action  variables 
must  also  have  broken  internal  symmetries  and  be  represented  by  complex  numbers 
in  an  internal  space.  In  this  way  the  broken  symmetries  of  matter  and  spacetime 
at  a  macroscopic  scale  of  pressure  and  energy  density  require  the  microscopic 
formalism  of  particle  dynamics  to  also  have  internal  phase  angles  for  the 
dynamical  variables.  The  reverse  argument  can  also  be  made  that  the  broken 
symmetry  of  the  microscopic  kinematic  and  dynamical  variables  leads  to  internal 
phase  angles  being  associated  with  pressure  and  internal  energy.  The  internal 
structure  of  space  and  time  affects  the  measured  kinematic  and  dynamical 
quantities  of  mechanics.  In  this  way  mechanics,  the  oldest  discipline  of 
physics,  can  be  related  to  the  internal  structure  of  space  and  time  which  on  a 
macroscopic  scale  is  determined  mainly  by  gravity.  On  a  microscopic  scale  the 
possible  internal  motions  of  space  and  time  can  be  related  to  the  structure  of 
matter.  For  instance,  in  high-Te  superconducting  materials  it  is  thought  that 
both  space  and  time  are  coherent  and  that  the  Cooper  electron  pairs  move  about 


385 


each  other  by  internal  spacetime  motions.  In  this  way  the  factor  6 / tt  enters  the 
calculation  of  the  normalized  superconducting  energy  gap  and  gives  rise  to  it’s 
possible  high  values  compared  to  the  corresponding  values  predicted  by  standard 
BCS  superconductivity  theory.3 

The  space  and  time  coordinates  must  be  written  as  complex  numbers  in  inter¬ 
nal  space  as  follows1 


V  exp(J0v) 

(1) 

t  exp(j6p 

(2) 

where  v  *  x  ,  y  ,  z  for  cartesian  coordinates;  r  ,  >  z  for  cylindrical  polar  co¬ 

ordinates;  and  p  ,  $  ,  for  spherical  polar  coordinates.  Strictly  speaking  the 
internal  phase  angle  of  time  0^  is  associated  with  each  space  coordinate  so  that 
for  the  three  elementary  coordinate  systems  the  internal  phase  angles  of  time 
are: 


«x  a2 

9t  -  9t  *  9t 


Ar  ft*  a2 
9t  •  9t  *  9t 


ep  ,  e*  ,  e* 

t  t  t 


(3) 


The  following  quantities  often  appear  in  the  calculations  involving  partially 
coherent  broken  symmetry  space  and  time3 


tan  0  =  v30  /3v 

vv  v 


tan  6 ' 


t38^/3t 


tt  t’ 

as  for  example  in  the  following  differentials 

dv  -  sec  8vv  dv  exp[j(8v  +  Syv)] 

=  esc  6  vd8  exp[j(6  +8  )] 

vv  V  V  vv 

dt  -  sec  e£t  dt  exp[j(0^  +  8^t)] 

-  esc  6^t  tdQ^  exp[j(9^  +  8^t) ] 

For  incoherent  spacetime  8VV  *  0  and  8^t  “  0  while  for  coherent  spacetime 
8VV  *  7r/2  and  8tr  “  w/ 2  .  The  measured  coordinates  of  space  and  time  axe  given  by1 


(4) 

(5) 

(6) 

(7) 


m 


Jtt 
v  cos  0 


tV 

m 


t  cos  0. 


v  m  t 

The  complex  number  coordinate  speed  is  obtained  from  equations  (6)  and  (7)  as 


(8) 


v  _v  exp (10  )  ”  dv/dt 

v  v  K  J  w 


(9) 


where 


v  *  cos  B^L  sec  B  dv/dt 
V  tt  vv 

=  cos  8^\  esc  B  v  dS  /dt 

tt  VV  V 

■  sin  sec  Bvv  t  ^  dv/d6^ 

■  sin  B*«.  esc  B  v/t  d6  /d8^ 

tt  VV  V  t 

where 


9  «  0  +  B 

W  V  vv 


(10) 

(ID 

(12) 

(13) 

(14) 


and  where  v  =  x,y,z}r,4>,z;or  p  ,  <J>  ,  iji  .  The  particle  velocity,  momentum  and 
energy  are  written  as  complex  numbers  in  internal  space  as  follows1 


v  «  v  exp (10  ) 

v  v  J  w 


pv  -  pv  exp(j0pv) 


E  -  E  exp(j0£v)  (15) 


and  the  corresponding  measured  quantities  are 


v  =  v  cos  0 
vm  v  w 


p  -  p  cos  0 
vm  rv  pv 


E  •  E  cos  Q„ 
m  E 


For  the  case  of  harmonic  motion  the  space  coordinates  are  complex  numbers 
in  both  the  internal  and  the  external  spaces,  and  the  coordinate  magnitudes  that 
appear  in  equation  (l)  are  written  for  cartesian  coordinates  as 


x  -  x^  exp(iivt) 


y  «  y^  exp(iiot) 


z  ■  z  exp(itut)  (17) 


so  that  the  full  expression  for  the  complex  number  cartesian  coordinates  repre¬ 
sented  by  equation  (1)  are 

x  -  x^  exp(iojt  +  j0x)  (18) 

y  -  yu  exp(iuit  +  j0y)  (19) 

z  ■  z  exp(imt  +  j8  )  (20) 

CO  z 

or  in  general  for  harmonic  motion 


v  -  v  exp(j8v) 


v  •»  v^  exp(iwt) 


where  v  -  x,y,z  or  r  »  $  ,  z  or  p  ,  $  ,  i|/  .  From  equations  (4)  and  (21)  for 
harmonic  motion  it  follows  that 


tan  Bvv  -  -  x/u)  30v/3t 


This  suggests  that  for  harmonic  motion  in  external  space  the  angle  B^v  Is  an 
imaginary  number  in  external  space  given  by 
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evv  *  -  1QV  (23) 

where  Qv  *  real  number.  Then 

tan  ew  "  “  1  tanh  Qv  (24) 

which  gives  for  harmonic  motion  in  external  space 

tanh  /3t  (25) 

Qv  -  tanh"i(u)"139v/3t)  (26) 

3VV  “  -  i  tanh'i(a)_130v/3t)  (27) 


for  v  1  x  ,  y  ,  z  or  r  ,  ♦  ,  z  or  p  ,  $  ,  ip  . 

The  broken  symmetry  sine  and  cosine  functions  are  written  as1 


sin  tp  -  exp(j0g)}))  cos  ip  -  exp(-j0c^) 

(28) 

2  2  1/2 
-  [sin  (ip  cos  0^)  +  sinh  (if/  sin  0  )] 

(29) 

2  2  1/2 
-  [cos  (ip  cos  0^)  +  sinh  (ip  sin  8^)  ]  ' 

(30) 

tan  0g^  -  cot(ij/  cos  0^)tanh(ip  sin  0^) 

(31) 

tan  0^  *  tan(i|/  cos  0^)  tanh  (ip  sin  8^) 

(32) 

For  small  values  of  angle  magnitude  ip 

S.  ^  ip  0,^0. 

ip  sip  ip 

(33) 

which  are  useful  expressions  for  studying  the  motion  of 

a  pendulum. 

This  simple  introduction  to  broken  spacetime  symmetry  is  sufficient  to 
proceed  to  the  study  of  dynamical  systems  in  asymmetric  spacetime.  This  paper 
examines  the  effects  of  broken  spacetime  symmetries  on  the  basic  laws  and  for¬ 
malisms  of  kinematics  and  dynamics.  Only  an  elementary  development  of  the  me¬ 
chanics  of  a  single  particle  is  given.  Specifically,  Section  2  considers  kine¬ 
matics  in  asymmetric  spacetime,  and  Section  3  examines  Newton's  laws  of  motion 
in  spacetime  with  broken  internal  symmetries. 

2.  KINEMATICS  IN  ASYMMETRIC  SPACETIME.  This  section  develops  the  basic 

expressions  for  particle  speed  and  acceleration  in  spacetime  with  broken  inter¬ 
nal  symmetries.  The  concept  of  motion  in  totally  coherent  spacetime  is  intro¬ 
duced.  The  connections  are  made  between  the  measured  kinematical  quantities  of 
partially  coherent  spacetime  and  the  conventionally  calculated  kinematical 
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quantities  of  incoherent  spacetime. 

A.  Particle  Speed. 

The  speed  of  a  particle  in  broken  symmetry  spacetime  is  obtained  from  equa¬ 
tions  (9)  through  (14) 

vx  -  vx  exp ( j 6^)  -  dx/dt  (34) 

where 


cos 

0X 

ptt 

sec 

6xx 

dx/dt 

(35) 

cos 

0X 

ptt 

CSC 

6xx 

x  d0  /dt 

X 

(36) 

sin 

0X 

tt 

sec 

8xx 

t  *  dx/d0X 

(37) 

sin 

0X 

tt 

CSC 

6xx 

x/t  d0x/d0X 

(38) 

and 


e  »  e  +  0 

VX  X  XX 


ex  -  $x 

t  tt 


(39) 


where  0^  and  0*t  are  given  by  equations  (4)  and  (5)  respectively.  The  measured 
particle  speed  is  given  by  equations  (16)  and  (35)  through  (39)  as 


rax 


v  cos  0 
x  vx 


(40) 


The  single  particle  momentum  is  then  given  by 


- p*  “PO  V 


p  *  mv 

X  X 


6 


e 


(41) 


px  vx 

Equations  similar  to  (34)  through  (.39)  can  be  developed  for  the  y  and  z  coordinates. 
B.  Particle  Acceleration. 


For  a  broken  symmetry  spacetime  the  particle  acceleration  is  given  by  equa¬ 
tions  (34)  through  (39)  as 


\  “  ax  exp^eax*  “  dVd?  “  d2*/d^2 


(42) 


From  Newton’s  law  of  motion  written  as 


Fx  •  r«  ’  “x 


(*3) 


it  follows  that 


F  ■»  ma 


e„.  -  e 

Fx  ax 


(44) 
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The  measured  acceleration  is  given  by 


a  cos  6 
x  ax 


The  measured  force  is  given  by  equations  (A3)  and  (44)  as 


F  cos  0_ 
x  Fx 


Combining  equations  (44)  through  (46)  gives 

F  «  ma  (47) 

rax  mx  '  7 

and  therefore  the  measured  acceleration  is  determined  from  the  measured  force 
by  Newton's  law  of  motion.  The  values  of  the  acceleration  magnitude  ax  and  ac¬ 
celeration  internal  phase  angle  0^  will  now  be  calculated  for  several  cases  of 
interest. 

Case  a.  Incoherent  Space  and  Incoherent  Time. 

A  general  expression  for  the  acceleration  is  developed  that  can  be  used 
to  deduce  the  limiting  case  of  incoherent  space  and  incoherent  time  which  is 
described  by 


6-0 

x 


0  =  0 
xx 


ex  -  o 

tt 


The  appropriate  expressions  for  the  acceleration  magnitude  and  internal  phase 
angle  are  deduced  from  equation  (42)  to  be 


a  -  sec  0  cos  0_,.  dv  / dt 
x  vxvx  tt  x 


0  -  0  +0  -  0*  -  0* 
ax  vx  vxvx  t  tt 


where 

tan  0  -  v  30  /3v 

vxvx  X  vx  X 

Combining  equations  (35)  and  (49)  gives 

a  -  cos  0*  sec  0  d/dt(cos  0X  sec  0  dx/dt) 

x  tt  vxvx  tt  xx 

2  x  2  2 

*  cos  0  sec  B _ _  sec  0  d  x/dt 

tt  VXVX  XX 

„  2.x  2.  .2  ,,2 

cos  sec  0  d  x/dt 

tt  xx 


Combining  equations  (39)  and  (50)  gives 

e  -e+B  +b  -2(0*  +  0*  ) 

ax  x  xx  vxvx  '  t  tt7 
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When  the  conditions  in  equation  (48)  are  valid  the  expressions  in  equations  (52) 
through  (54)  reduce  to  the  standard  case  of  incoherent  space  and  incoherent  time. 


Case  b.  Coherent  Space  and  Incoherent  Time. 

General  expressions  for  the  acceleration  are  now  deduced  which  can  be  used 
to  make  the  transition  to  the  case  of  coherent  space  and  incoherent  time  which 
is  described  by 


8  *  tt/2 

xx 


6*-0 


6*  -  0 
tt 


(56) 


The  appropriate  general  expression  for  the  acceleration  magnitude  and  internal 
phase  angle  for  this  case  is  obtained  from  equations  (42)  and  (55)  to  be 


a  ”  esc  6  cos  8*  v  d0  /dt 
x  vxvx  tt  x  vx 

0  -0+B  +8  -2(0X  +  6X) 

ax  x  xx  vxvx  t  tt 


(57) 

(58) 


Combining  equations  (36)  and  (57)  gives 

2  x 

a  =  esc  8  cos  esc  8  x  d0  /dt  d0  /dt 
x  vxvx  tt  xx  x  vx 


(59) 


where  from  equation  (39) 


d0  /dt  *  d/dt (0  +  &  -  0*  -  8*  ) 

vx  x  xx  t  tt 


(60) 


For  the  case  at  hand  it  is  convenient  to  write  the  acceleration  in  equation  (42) 
as 


Ix  -  ax  «p(j6ax)  -  expUe^) 


(61) 


=  dv  /dt  *  d^x/dt^ 
x 


where 


a  -  -  a 
x  x 


Q+  ■  Q  -IT 

ax  ax 


(62) 

(63) 


so  that 


aT  ■  -  esc  8  cos  8*  v  d6  /dt 
x  vxvx  tt  x  vx 

2  x 

■  -  esc  8  cos  8.,..  esc  8  x  d0  /dt  d8  /dt 
vxvx  tt  xx  x  vx 


(64) 

(65) 


8+  -  0  +8  +6  -  2(0X  +  0X  )  -  ir 

ax  x  xx  vxvx  t  tt' 


(66) 
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which  is  an  equivalent  description  of  the  acceleration. 


For  the  special  case  of  coherent  space  and  incoherent  time,  equations  (56), 
(65)  and  (66)  give 


a^  *  -  esc  gCi  x(d8  /dt)2 
x  vxvx  X 


(67) 


ecif 

ax 


9 

x 


+  6 


ci 

vxvx 


-  tt/2 


(68) 


where  from  equations  (36),  (39),  (51)  and  (56)  it  follows  that 
ci 


v  '  *  x  d8  /dt 

X  X 


8Ci  *  6  +  ir/2 

VX  X 


tan  Bci  -  Ecj/Fcj 

VXVX  xt  xt 


where 


ECi  =  (d0  /dt)2 

»*  >•  X 


ECi  >  0 

xt 


FC*  =  d2e  /dt2 

xt  X 


FC*  <  0 
xt 


Equation  (71)  then  gives 


„ci 


esc  -  f(ECS2  +  <FCJ)2]  1/2/ECJ 

vxvx  xt  xt  xt 


(69) 

(70) 

(71) 

(72) 

(73) 

(74) 


Because  F 


ci 

xt 


0  it  follows  from  equation  (71)  that 


BCl 

VXVX 


tt/2  + 


'xt 


(75) 


where  Kxt  >  0  is  a  small  number  which  is  also  given  by 
.ci, 


tan  k 


xt 


(76) 


Combining  equations  (67),  (68),  (74)  and  (75)  gives 
cit 


a 


0 


x 

cit 

ax 


-x[(E^)2  + 


0  +  K 

X  xt 


o£)V/2 


(77) 

(78) 


Cit 

The  angle  9^  is  generally  a  small  number.  Clearly  the  acceleration  in  this 
case  is  directed  opposite  to  the  displacement  x. 
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Case  c.  Incoherent  Space  and  Coherent  Time. 

An  expression  for  the  acceleration  is  now  derived  from  which  the  limiting 
case  of  incoherent  space  and  coherent  time  can  be  obtained.  This  limiting  case 
is  described  by 


9  -  0 

x 


B  -  0 
xx 


■  1/2 


(79) 


The  required  general  expression  for  the  acceleration  magnitude  and  acceleration 
phase  angle  is  obtained  from  equations  (42)  and  (55)  as 


a  -  sec  6  sin  B  t  dv  /de„ 
x  vxvx  tt  x  t 

0  -0+B  +6  -2(0X+BX) 

ax  x  vxvx  xx  t  tt7 

Combining  equations  (37)  and  (80)  gives 

sx  v-1 


(80) 

(81) 


a  -  sec  B  sin  3*  l~l  d/d0x(sin  BX  sec  B_  t-1  dx/d0x)  (82) 

X  VXVX  tt  t  tt  XX  t 

sec  B  sin2BX  sec  S  t  2  d2x/d0x2  (83) 

vxvx  tt  xx  t 

For  this  case  it  is  convenient  to  rewrite  the  acceleration  equation  (42)  in  the 
following  form 


Ix  -  ax  expCjQ^)  *»  ax  exp^Q^)  -  dvx/dt  -  d2x/dt2 


where 


a  *  -  a 
x  x 


0'  -  0  +  TT 

ax  ax 


(84) 

(85) 

(86) 


so  that  an  equivalent  representation  of  the  acceleration  is 


vxvx 


vxvx 


vxvx 


sin  Bx 

-1 

t 

dv  /d0X 

(87) 

tt 

x  t 

sin  ext 

t-1 

d/d0x(sin  BX  sec  B  t  1  dx/d0x) 

l  Cl  XX  t 

(88) 

.  2.x 
sin 

sec 

o  -2  ,2  iJnx2 

B  t  d  x/dQ_ 

(89) 

tt 

xx  t 

i 

vxvx 

CD 

W 

CM 

+  6«> + * 

(90) 

0'  -  6  +  B  +  8 

ax  x  xx 

In  the  case  of  incoherent  space  and  coherent  time,  equations  (79),  (88) 
and  (90)  give 
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_ic' 

x 

ic' 

ax 

ic' 


a 


0 


'  sec  £v*  t_2  ^dQf 

gic  _  20x 

vxvx  t 


(91) 

(92) 


where  0^  is  a  small  number,  and  where  from  equations  (37),  (39),  (51)  and  (79) 
it  follows  that 


0 


,ic 

x 

ic 

vx 


t  *  dx/d0X 


-  0*  -  7T/2 


tan  3 


ic 

vxvx 


E^/F1? 
xt  xt 


(93) 

(94) 

(95) 


where 


Eic  =  -  dx/d0x 
xt  t 

Fic  =  d2x/d0x2 
xt  t 


Edc  >  0 
xt 

Fic  >  0 
xt 


sec  3iC  -  [  (E1^)2  +  (FiS2]1/2/Fi^ 
vxvx  '  xt7  v  xt'  1  xt 


(96) 

(97) 

(98) 


Because  E^£  ^  0  and  F^  >  0  it  follows  that  B^vx  is  a  small  positive  number  for 
this  case.  Equations  (91)  and  (98)  give 


"  "  t"2[(E«)2  +  <Fxt)2]i/2 


(99) 


Case  d.  Coherent  Space  and  Coherent  Time. 

In  this  section  an  equation  for  the  acceleration  of  a  particle  is  obtained 
which  can  be  used  to  attain  the  limit  of  coherent  space  and  coherent  time  which 
is  defined  by 


0. 


xx 


ir/2 


B 


tt 


*/2 


(100) 


The  general  expression  for  the  magnitude  and  internal  phase  angle  of  the  accel¬ 
eration  is  obtained  from  equation  (42)  and  (55)  as 


ax 


C8C  6vxv*  sl”  Btt  \,C  d9v*/de? 


0+6  +  B  -  2(8X  +  BX  ) 

x  vxvx  *xx  v  t  tt 


(101) 

(102) 


Combining  equations  (38)  and  (101)  gives 

a  ■  esc  8  sin2SX  esc  x/t2  d8  /d8x  d8  /d0x 
x  vxvx  tt  xx  x  t  vx  t 


(103) 
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where  from  equation  (39)  it  follows  that 

de  /d@*  -  de  /d0*  -  1  +  d/d8*(e  -  b*  )  (104) 

VX  t  X  t  t  XX  tt 

Another  expression  for  the  acceleration  can  be  obtained  from  equations  (38)  and 
(80)  which  gives 

a  =*  sec  8  sin  8*  t  *  d/d0*(sin  6*  esc  8  x/t  d0  /d0*)  (105) 

x  vxvx  tt  t  tt  xx  x  t 

In  the  present  case  it  is  convenient  to  write  the  acceleration  in  equation  (42) 
as 

Ix  “  ax  exp(j0ax)  -  a^  exp(j0^x)  -  ivx/dt  -  d2x/dt2  (106) 

where 


-  a 


8  ’  =0  +  tr 

ax  ax 


and  an  alternative  representation  of  the  acceleration  is 

a'  *  -  esc  8  sin2 8*  esc  8 _  x/t2  d0  /d0*  d6 

x  vxvx  tt  xx  x  t  vx 

■  -  sec  ev*vx  sln  r‘  d/d6t(sln  e«  “«  6XX 


/d8* 

x/t  d8x/d0^) 


(107) 

(108) 


(109) 

(110) 


0 '  =*0  +8  -  9*  -  6*  +  » 

ax  vx  vxvx  t  tt 


0+8  +8  -  2(0*  +  8*  )  +  ir 

x  vxvx  xx  t  tt 


A  comparison  of  equations  (109)  and  (110)  gives 

ta"  BV*VX  -  Cxt/DXt 

esc  6  -  (C2  +  D2  )l/Z/C 

VXVX  xt  xt'  xt 

sec  8  -  (C2  +  D2  )1/2/D 

vxvx  xt  xt'  xt 

where 


(111) 


(112) 

(113) 

(114) 


Cxt  “  8in  8«  CSC  8xx  x/t  d0x/d0xtd6x/d0x  -  1  +  d/d0x(Sxx  -  8Xt)]  (115) 

Dxt  -  d/d0*(sin  8Xt  esc  8^  x/t  d0x/d0*)  (116) 

where  in  general  C  <  0  .  Therefore 
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a'  -  -  C  /t  esc  8  sin  6* 
x  xt  vxvx  tt 

■  -  D  /t  sec  6  sin  8* 
xt  vxvx  tt 

Equations  (117)  and  (118)  can  also  be  written  as 

ax  *  '  <4  +  °xt>l/2/t  Sin  4 


(117) 

(118) 

(119) 


For  the  case  of  coherent  space  and  coherent  time  equations  (109)  through 
(111)  become  with  the  help  of  equation  (100) 


a'  =  -  esc  8C  x/t2  d0  /d0X (d6  /d0*  -  1) 

X  VXVX  X  t  X  t 

-  -  sec  8°  x/t2  d20  /d0x2 
VXVX  X  t 


(120) 

(121) 


6'  =0  +8  -  20  +  *72 
ax  x  vxvx  t 


(122) 


From  equations  (38)  and  (39)  it  follows  that 
vC  =  x/t  d0  /d0X 


0 


vx 


0-9^ 

X  t 


X'  t 
X 


(123) 

(124) 


Equations  (51),  (123)  and  (124)  give 


tan  8 


vxvx 


E  ,/F  . 
xt  xt 


where 


(125) 


E<\  -  d8  /d0*(d0  /d0*  -  1)  EC„  <  0 

Xt  X  t  X  t  xt 

F«  ■  Fxt  >  0 

“=  C*  ■  f<Ext>2  +  <Fxt,2>1/2/Ext 
Cvx  -  '(Ext>2  +  <Fxt)2l‘/2/Fxt 


(126) 

(127) 

(128) 
(129) 


where  d0x/d8t  <  1  .  In  a  gravitational  field,  for  example,  the  following  rela¬ 
tionship  holds1 


ZQr;  %  30 

C  X 


d0  /d0*  ~  2/3 
x  t 


(130) 


Therefore  in  general  E^t  0  and  *xt>  0  ,  and  it  is  convenient  to  introduce  a 
new  angle  by  writing 
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which  is  equivalent  to  equation  (100).  The  magnitude  and  phase  angle  equations 
(123)  and  (124)  are  equivalent  to  the  following  complex  number  expression  for 
the  particle  speed  in  coherent  spacetime 

v£  *  (dx/dt)C  “  x/t  d9x/d0*  (137) 

The  magnitude  and  phase  angle  of  the  acceleration  that  are  given  in  equations 
(133)  and  (134)  correspond  to  a  complex  number  acceleration  that  is  obtained 
from  equation  (106)  and  is  given  by 

sc  -  (d2x/dt2)c  -  x/t2[de  /de*(de  /de*  -  1)  -  jd2e  /de^2]  (138) 

X  X  L  A  U  X  v 

as  it  must  be  because  of  the  definition  of  Sxt  given  in  equations  (131)  and 
(132).  For  a  linear  solution  of  the  form 

0  «  a  9*  +  b  a  <  l  (139) 

X  X  t  X  X 
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(140) 


it  follows  from  equation  (133)  that 

ax  “  ”  x/t2|ax (ax  -  1)|  =  -  x/t2ax(l  -  ax) 

which  corresponds  to  Fxt  -  0  and  *  -  it/2  by  equations  (125)  through  (127). 

C.  Harmonic  Motion. 

For  harmonic  motion 

Vx  "  Vxuj  exP(ia,t)  (141) 

and  it  follows  from  equation  (23)  that  for  harmonic  motion  in  external  space 
with  x  and  t  varying  6^  is  an  imaginary  number  given  by 

0xx  *  "  iQx  tanh  %.  m  “~J  39x/3t  (142) 


sec  8 


esc  6^  *  i  csch 


so  that  equations  (35)  through  (38)  become 

x 

v  =  iwx  cos  8  ^  sech  0 
x  tt  TC 

**  ix  cos  8*t  csch  Q^.  d0x/dt 
»  iwx/t  sin  sech  dt/d0* 

»  ix/t  sin  8*  csch  d0x/d0* 


(143) 

(144) 

(145) 

(146) 

(147) 


where  for  harmonic  motion  the  coordinates  and  velocities  are  complex  numbers  in 
external  space  so  that 

x  ■  x  exp(iwt)  v  *  v  exp(iut)  v  -  irnx  (148) 

a)r  XX10r  xu>  ID 

x  *  x  exp(iu)t  +  10  )  v  »  v  exp(iuit  +  10  )  (149) 

u>  Jx  X  X(Dr  J  vx 

Comparing  equations  (144)  and  (147)  gives 

ut  -  tan  8*t  coth  d0x/d0*  (150) 

while  combining  equations  (144)  and  (145)  gives 

a)  ■  coth  d0x/dt  (151) 

For  the  case  of  harmonic  motion  in  external  space,  where  x  and  t  are 
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variables,  8 


vxvx 


is  an  imaginary  number  given  by 


tan  8  =  v  96  /9v 

VXVX  XX  X 


(152) 


x90  /3x  *  -  i/w  d0  / dt 

vx  vx 


Introducing  a  new  quantity  by 
Bv*vx  ■  - 

allows  equation  (152)  to  be  written  as 
tan  8 

VX  VA 

-1 


-  i  tanh  Q  *  -  i/w  d0  /dt 
vx  vx 


tanh 


Qvx 


de  /dt 
vx 


sec  8  *■  sech  Q 

vxvx  vx 


esc  8  “  i  csch  Q 

vxvx  'vx 


(153) 

(154) 

(155) 

(156) 


which  are  valid  for  harmonic  motion.  For  harmonic  motion  equation  (53)  can  be 
rewritten  using  equation  (143)  as  follows 


2  2  x 

a  'v  -  (u  x  cos  B  sec  8,_^  sech  0 
X  tt  vxvx  X 


(157) 


or  since  Byxvx  is  an  imaginary  number  given  by  equation  (153)  for  the  case 
where  x  and  t  are  variables  it  follows  from  equation  (156)  that  equation  (157) 
can  be  written  as 


2  2  x 

ax  'v  -  *  x  cos  sech  sech  Qx 


(158) 


D.  Measured  and  Conventionally  Calculated  Kinematic  Quantities. 

The  average  speed  of  a  particle  in  spacetime  with  broken  internal  symme¬ 
tries  is  defined  as 


v3v  -  vav  exp(j0av)  =*  x/t  *  x/t  exp[  j  (0  -  0X)] 

X  x  K  J  vx  X  t 


av  . 
v  -  x/t 

X 


5av 

vx 


0  -  0A 
X  t 


(159) 

(160) 


The  conventional  definition  of  the  average  speed  is  given  by 

V-  “  x/t*  -  (x  cos  0  )/(t  cos  0*) 
conx  mm  x  t 


(161) 


while  the  measured  value  of  the  average  speed  is  obtained  from  equations  (159) 
and  (160) 


av  av  _av  .  /a  Qx. 
v  ■  v  cos  0  •  x/t  cos(0  -  0  ) 

mx  x  vx  x  t 


(162) 
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(163) 


A  comparison  of  equations  (161)  and  (162)  gives 

<v«  ‘  vconx)/vxV  '  oos(6x  '  8t>  '  (cos  V/(coa  et> 

'  <(°x  - 

where  equation  (164)  is  valid  for  small  angles. 


(164) 


The  value  of  the  instantaneous  velocity  is  defined  for  broken  symmetry 
spacetime  by  equations  (34)  through  (39).  The  conventionally  defined  instanta¬ 
neous  velocity  is  given  by 


v  ■  dx  /dt 
conx  m  m 


d(x  cos  8  )/d(t  cos  0*) 
x  t' 


*  f  dx/dt  cos  8  sec  0 
x  x  t 


(165) 

(166) 


where  tx  is  defined  by  equation  (8) ,  and  where 

f  -  (1  -  tan  8  tan  8  )  /  ( 1  —  tan  6X  tan  gx  ) 
x  x  xx'  v  t  tt 


(167) 


The  measured  value  of  the  instantaneous  speed  of  a  particle  in  approximately 
incoherent  spacetime  is  obtained  from  equations  (34),  (35),  (39)  and  (40)  to  be 


rax 


v  cos  0 
x  vx 


(168) 


dx/dt  cos  B  sec  B  cos  0 

tt  xx  vx 


v  / f  cos  B^  sec  B  sec  8  cos  0  cos  8 

conx  x  tt  xx  x  t  vx 


where  equation  (35)  was  selected  for  vx  ,  and  where  fx  ?  0  .  Combining  equations 
(166)  and  (168)  gives 


(v  -  v  )/v 
rax  conx  x 


cos(0  +  B  -  0*  -  B*  ) 

X  XX  t  tt 


(169) 


X  X 

-  f  cos  6  sec  0  cos  B  sec  8 

X  X  t  XX  tt 

'v  (ex  +  bx  )(e  +  B  -  0X  -  bx  ) 

t  tt  x  xx  t  tt 


For  the  case  where  Bgg  *  0  and  $xx  “  0  this  equation  becomes  the  same  as  equa¬ 
tion  (164)  for  the  average  speeds.  Equations  (165)  and  (166)  show  that  a  con¬ 
ventionally  defined  stationary  particle  is  defined 


conx 


dx  /dt  *  0 
m  m 


(170) 


Generally  this  Is  associated  with  dx/dt  «  0  ,  but  if  it  is  associated  with 


fx  -  0  then 
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cot  8  -  tan  8 

X  XX 


dx/x  -  tan  8  d8 

X  X 


B*x  ■  ’/2  -  ex  «’*) 


which  integrates  to 
x  ■  k  sec  8 


x  ”x  cos  8  =  k 

m  x 


x  "m  - x  "  <172> 

where  k  *  constant.  For  this  case  dx/dt  +  0  and  in  fact  from  equation  (172) 

(173) 


dx/dt  «*  k  sec  8  tan  8  d8  /dt 

XXX 


For  the  case  f x  -  0  ,  which  corresponds  to  vconx  -  0  from  equation  (166)  and 
which  yields  the  relation  in  equation  (173),  it  follows  from  equation  (35)  that 
the  particle  velocity  magnitude  and  the  measured  particle  velocity  are  given 
respectively  by 

(174) 

/d6*  (175) 

A  VA  X  C 

where  8  is  given  by  equation  (39)  . 

The  conventionally  defined  instantaneous  velocity  given  by  equation  (165) 
can  also  be  written  as 


v  -  k 

cos 

8* 

sec 

8 

sec 

6 

tan 

8 

d6  /d0X 

X 

tt 

xx 

X 

X 

x  t 

< 

n 

7? 

cos 

8* 

sec 

8 

sec 

8 

tan 

8 

cos  8 

mx 

tt 

XX 

X 

X 

vx 

v  *  g  x/t  d8  /d8x  sin  8  esc  0* 
conx  x  x  t  x  t 


where 


e  *  (1  -  cot  8  x  *dx/d8  )/(l  -  cot  8*  t  *dt/d8X) 

<C  XX  L  C 


(176) 


(177) 


In  this  case  the  measured  instantaneous  speed  is  obtained  from  equations  (38) 
and  (40)  to  be 


v  -  v  cos  8 
mx  x  vx 


(178) 


x/t  d8  /d8*  esc  8  sin  8X  cos  8 

x  t  xx  tt  vx 

X  X 

v  /g  esc  8  sin  8_„  esc  8  sin  8„  cos  8 
conx  °x  xx  tt  x  t  vx 


where  g^  +  0  . 


The  particle  momentum  is  written  as 
Px  •  Px  exp(j8px)  -  mvx  -  mdx/dt 


p  “  mv  -  m  sec  B  cos  8....  dx/dt 

X  X  XX  C  L 

e  -  e  -  e  +  b  -  e*  -  b* 

px  VX  X  xx  t  tt 


(179) 

(180) 
(181) 
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The  measured  momentum  is 


p  =  p  cos  0 
rmx  rx  px 


The  conventional  momentum  is 


p  =  mv  ®  mdx  /dt 
conx  conx  m  m 


Then 


p  “  m  v 
mx  px  conx 


where 


(182) 


(183) 


(184) 


X  X 

m  “  m/f  cos  fi  sec  $  sec  0  cos  0  cos  0 
px  x  tt  xx  x  t  px 


(185) 


where  fx  f  0  ,  and  where  mpX  -  effective  mass  which  relates  the  measured  momen¬ 
tum  to  the  conventional  definition  of  particle  speed  in  the  x  direction. 

The  average  acceleration  of  a  particle  in  broken  symmetry  spacetime  is 
given  by 

aav  =  aav  exp(j0av)  =  v  /t  «  v  /t  exp[j(6  -  0X)]  (186) 

x  x  K  J  ax  x  x  vx  t 


a 


av 

x 


V  /t 
X 


eav 

ax 


(187) 


where  vx  «  speed  of  particle  after  time  t  for  a  particle  starting  from  a  rest 
position.  The  conventional  definition  of  average  acceleration  for  a  particle 
initially  at  rest  is  given  by 


av 

a 

conx 


v  / 1 
mx  m 


(vx  cos  9vx)/(t  cos  0X) 


(188) 


The  measured  value  of  the  average  acceleration  is  obtained  from  equation  (186) 
to  be 


av  av  .av 

i  ■  a  cos  0 
mx  x  ax 


v  /t  cos(0  -  0X) 
x  vx  t 


From  equations  (186)  through  (189)  it  follows  that 

(aav  -  a3V  )/aav  ■  cos(0  -  0X)  -  (cos  0  )/(cos  0X) 
mx  conx  x  vx  t  vx7  t 


^  6X(6  -  0X) 

t  vx  t 


(189) 


(190) 

(191) 


where  equation  (191)  is  valid  for  small  angles.  Therefore  in  an  external  field 
which  breaks  the  spacetime  symmetry  the  measured  average  acceleration  does  not 
have  the  same  value  as  the  conventionally  calculated  average  acceleration. 


The  instantaneous  acceleration  is  given  by  equations  (42)  through  (158). 
The  conventionally  defined  acceleration  is  expressed  in  terms  of  measured  space 
and  time  coordinates  as  follows 


a 


conx 


dv  /dt  =  d(v  cos  0  )/d(t  cos  0X) 

mx  m  x  vx  t 


(192) 


where 


X 

“  h  dv  /dt  cos  6  sec  9 
XX  vx  t 


h  =  (1  -  tan  6  tan  8  )  /  (1  -  tan  0*  tan  8*  )  (193) 

x  vx  vxvx  t  tt 

where  from  equation  (35) 

dv  /dt  =  d/dt(cos  8*  sec  8  dx/dt)  (194) 

X  L  L  XX 

'v  cos  8*  sec  8  d^x/dt^  (195) 

k.  t  XX 


Combining  equations  (192)  and  (195)  gives 

x  x  2  2 

a  'x*  h  cos  8  _  sec  8  cos  6  sec  9_  d  x/dt 
conx  x  tt  xx  vx  t 


(196) 


On  the  other  hand  the  measured  value  of  the  acceleration  of  a  particle  in  bro¬ 
ken  symmetry  spacetime  is  obtained  from  equations  (49),  (50),  (53)  and  (55)  to  be 


a  -a  cos  0 
mx  x  ax 


=  cos  B_..  sec  8  cos  0  dv  /dt 
tt  vxvx  ax  x 

2  x  2  2 

•v  cos  6  sec  8  sec  6  cos  0  d  x/dt 
tt  vxvx  xx  ax 


(197) 

(198) 

(199) 


where  the  internal  phase  angle  of  the  acceleration  is  taken  from  equations  (50) 
or  (55).  Clearly  aCOnx  and  amx  are  not  equivalent.  Combining  equations  (49), 
(50),  (55),  (192)  and  (198)  gives 


X  X 

(a  -  a  )/a  *  cos  0  -  h  cos  0  sec  0_  sec  8.,..  cos  8 

mx  conx  x  ax  x  vx  t  tt  vxvx 

*  (0*  +  e*  )(e  +  (3,  -  e*  -  8*  ) 

t  tt  VX  VXVX  t  tt 

*  (0*  +  6*  )[e  +8+8  -  2(0*  +  8*  )] 

t  tt  x  xx  vxvx  t  tt 


(200) 

(201) 

(202) 


where  equations  (201)  and  (202)  are  valid  for  small  internal  phase  angles. 
From  equations  (192)  and  (197)  it  follows  that 


X  X 

a  *  a  /h  cos  8  _  sec  8  sec  0  cos  0_  cos  0 
rax  conx  x  tt  vxvx  vx  t  ax 


dvx/dt  cos  Btt  sec  8^  cos  0ax 


(203) 

(204) 
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where  0ax  is  given  by  equations  (50)  or  (55) ,  and  hx  ^  0  . 

For  the  case  when  hx  ■  0  and  the  conventional  acceleration  given  by  equa¬ 
tion  (192)  is  zero,  aconx  »  0  ,  then  it  follows  from  equation  (193)  that 


tan  3 


v  38  /3v  “  cot  0 

X  VX  X  vx 


it/2  -  0 


k  sec  0 


v  »  v  cos  0 
xm  x  vx 


Combining  equations  (49)  and  (206)  gives 

a  ■  k  cos  gx  sec  3  sec  6  tan  0  d0  /dt 
x  tt  vxvx  vx  vx  vx 

The  measured  acceleration  is  then  given  by 

a  ■  a  cos  0 
mx  x  ax 


(205) 


(206) 


(207) 


(208) 


where  0^  is  given  by  equations  (50)  and  (55).  This  is  an  example  of  a  case 
where  a  force  exerted  on  a  body  is  associated  with  a  zero  value  of  acceleration 
in  the  conventional  sense  because  aconx  =  0  ,  but  does  have  a  measured  accelera¬ 
tion  value  given  by  equation  (208) .  The  acceleration  sensed  by  dynamical  mea¬ 
surements  is  given  by  equation  (208)  and  (47) . 

E.  Measured  and  Conventional  Angular  Velocity. 


The  conventional  angular  speed  is  given  by 

a)  “  d<|>  /dt  ■  f ,  cos  0 ,  sec  0^  d<j>/dt 
con  m  m  <t>  t 


where 


*  <j>  COS  0, 


t  **  t  cos  e 
m  t 


f  -  (1  -  tan  3.,  tan  0  )/(l  -  tan  6^t  tan  0^) 


(209) 


(210) 


(211) 


where  0^  and  8^t  are  the  time  internal  phase  angles  associated  with  rotational 
motion.  If  f^  =  0  then  mcon  »  0  and 


tan  3j 


cot  0, 


$30  ,/3<f>  *  cot  0, 


ir/2  -  9, 


4>  *  k  sec  0 


(212) 


(213) 


Therefore  a  rotating  system  may  be  conventionally  interpreted  to  be  at  rest, 
but  rotates  internally  so  that  the  magnitude  of  the  angle  changes  in  the  follow¬ 
ing  manner  from  equation  (213) 
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d9/dt  ■  k  sec  8,  tan  8.  d8,/dt 
9  9  9 


(214) 


The  complex  number  angular  speed  is  written  as 


3*0)  exp(je^)  -  d9/dt 


(215) 


and  the  measured  angular  speed  is  given  by  equations  (10)  and  (16)  as 


a)  -  sec  B.  ,  cos  fr\  cos  8  dd>/dt 
m  99  tt  a) 


(216) 


k  cos  81^  sec  8, .  cos  8  sec  8,  tan  8,  d8,/dt 
tt  99  ui  9  99 


where  from  equation  (14) 


6  -  6.  +  B..  -  6  - 

u  9  99  t  tt 


(217) 


where  8^  and  B^t  are  given  by  equations  (4)  and  (5)  respectively. 
The  angular  momentum  is  written  as 


L  ■  L  exp(j8T)  »  Id9/dt 


(218) 


where  the  moment  of  inertia  is  written  as 


I  -  I  exp^Qj) 


I  -  mr 


6I  '  29r 


(219) 


The  measured  value  of  the  moment  of  inertia  is  given  by 


I  ■  I  cos  8 
m  I 


(220) 


From  equations  (10),  (14)  and  (218)  it  follows  that  the  magnitude  and  internal 
phase  angle  of  the  angular  momentum  is  given  by 

L  »  1  sec  8, .  cos  8^  d9/dt  (221) 

99  tt 


\  -  ei + % + -  et  -  •«  ■  ei + 9» 


(222) 


where  from  equation  (4) 


tan  8, .  »  938, /39 
99  9 


(223) 


The  measured  angular  momentum  is  given  by 


L  *  L  ccs  8T 
m  L 


(224) 


I  sec  8^  cos  8^t  cos  8^  d9/dt 


(225) 
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Combining  equations  (209)  through  (225)  gives 


L  *  I  d9  / dt  *  I  r-i_ 
m  erf  m  m  eff  con 

where  the  effective  moment  of  inertia  is  given  by 

I 


t  «  I  /f ,  sec  g.,  cos  g^.  sec  0T  cos  0?  sec  0,  cos  0T 
eff  m  0  9<P  tt  I  t  9  L 


(226) 


(227) 


for  y>  0  .  For  the  case  f^  -  0  the  conventional  angular  speed  in  equation  (209) 
has  a  zero  value,  while  equations  (214),  (221) »  (224)  and  (225)  give 


L  ■  kl  sec  g, ,  cos  g:\  sec  0,  tan  0,  d8,/dt 
•99  tt  9  9  9 


m 


kl  sec  8, .  cos  8:\  sec  0,  tan  0.  cos  0T  d0./dt 
(pep  tt  9  L  9 


(228) 

(229) 


Equations  (226)  and  (227)  are  not  valid  for  the  case  f^  -  0 


3.  NEWTON’S  LAW  OF  MOTION  IN  BROKEN  SYMMETRY  SPACETIME.  Newton's  law  of 
motion  for  a  particle  in  a  potential  field  can  be  written  for  spacetime  with 
broken  internal  symmetries  as  follows4-6 


mi  =  md^x/dt^  »  -  3W/3x  *  F 

X  X 


(230) 


where  m  ■  particle  mass  and  W  ■  complex  number  potential  which  can  be  written  as 

W  -  W  exp(j0w)  (231) 

The  derivatives  of  the  potential  function  can  be  written  in  four  ways.  Repre¬ 
senting  the  complex  number  force  in  the  following  way 


F  -  F  exp(j0_  ) 
x  x  Fx 


3W/3x 


(232) 


gives 


F  - 

X 

-  cos 

Bxx 

sec 

BWW 

3W/3x 

(233) 

m 

-  cos 

Bxx 

CSC 

®ww 

W  30w/3x 

(234) 

m 

-  sin 

Bxx 

sec 

®ww 

x-1  3W/30x 

(235) 

** 

-  sin 

Bx* 

CSC 

Bww 

w/x  aew/3ex 

(236) 

'Fx 

ew  + 

^WW  ' 

-  0 

X 

Bxx 

(237) 

and 


where  0Fx  is  a  small  angle,  and  where 
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tan  8^  =■  W30h/3W 


(238) 


Then  the  phase  angle  condition  for  Newton's  law  is  obtained  for  nearly  inco¬ 
herent  space  and  nearly  incoherent  time  from  equations  (55)  and  (237)  as 


e  -  e_ 

ax  Fx 


(239) 


For  nearly  coherent  space  and  nearly  incoherent  time,  equations  (63)  and  (  237)  give 

SL  ■  9f*  (M0) 

For  nearly  incoherent  space  and  nearly  coherent  time,  equations  (86)  and  (237)  give 

8«  “  0Fx  <241> 

For  the  case  of  nearly  coherent  space  and  nearly  coherent  time  it  follows  from 
equations  (108)  and  (237)  that 


e  - 

ax  Fx 


(242) 


Newton's  law  of  motion  given  by  equation  (230)  will  now  be  considered  for  the 
four  kinematic  spacetime  conditions  that  were  considered  in  Section  2. 

A.  Incoherent  Space  and  Incoherent  Time. 

This  section  develops  the  Newtonian  law  of  dynamics  for  broken  symmetry 
spacetime  in  a  form  that  Is  suitable  to  make  the  transition  to  the  case  of  in¬ 
coherent  space  and  incoherent  time  which  is  described  by 


0x-° 


8  -=0 
xx 


ec  -  o 


-0 


(243) 


where  x  and  t  are  variables.  Combining  equations  (43),  (49),  (50),  (230),  (233) 
and  (237)  gives 


m  cos  8*  sec  8  dv  /dt  “  -  cos  8.  sec  fL*.  3W/3x 
tt  vxvx  x  xx  wW 


9  -  0  +g  -  0*  -  8* 

ax  vx  vxvx  t  tt 


(244) 

(245) 


9W  +  8WW  “  ex  ■  6xx 


where  &vxvx  and  ffyw  are  given  by  equations  (51)  and  (238)  respectively.  Com¬ 
bining  equations  (35)  and  (244)  gives 


m  cos  0*  sec  8  d/dt(cos  0*  sec  8  dx/dt) 
tt  vxvx  tt  xx 


(246) 


“  COS  8xx  8eC  ®WW  9W/9x 
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which  can  be  written  approximately  as 

m  cos2&^t  sec  ^  sec  0^  d2x/dt2  -v  -  cos  0^  sec  0^  3W/3x  (247) 


Combining  equations  (55)  and  (245)  gives 

0  -6+0  +0  -  2(0X  +  0*  ) 
ax  x  xx  vxvx  t  tt 


-  erT  + 


(248) 


W  +  ^WW  '  0X  "  B> 


If  the  potential  function  is  given  by  a  power  law  W  'v.  x~  then  equation  (238)  gives 


^WW  ”  Bj 


(249) 


Therefore  a  reasonable  approximation  to  equation  (248)  for  many  potential  func¬ 
tions  is  given  by 


28+0  +0  -  2(0*  +  0*  )  ^  0_ _ 
X  XX  vxvx  t  tt  W 


With  the  further  approximation  that  ^  8^  equation  (250)  becomes 


2(0+0  -  0X  -  0*  )  ^  0_, 

x  XX  t  tt  W 


(250) 


(251) 


Within  the  approximations  8-vxvx  %  ®xx  aQ<*  ®WW  ^  8  e9uati°ns  (55)  and  (248)  becomes 


0  ^0+20  -  2(0X  +  0*  ) 
ax  x  xx  t  tt 


^  0„  -  0 
W  x 


(252) 


while  equation  (247)  with  0y^  0^  becomes 

2  x  2  2 

m  cos  0  sec  0  sec  0  d  x/dt  -  3W/3x 

ll  VXVX  XX 


(253) 


and  with  the  further  approximation  that  8-yxvx  ^  equation  (253)  becomes 


m  cos20X  sec20  d2x/dt2  'v  -  3W/3x 
tt  xx 


(254) 


If  on  the  other  hand  it  is  assumed  that  0^^,^  ^  0  ,  or  equivalently  6xt  -  ir/2  , 
then  equations  (250) ,  (55)  and  (253)  become 


20+0  -  2(0X  +  0X  )  •v  0.. 

x  xx  v  t  tt  W 

0  ^  0  +0  -  2(0X  +  0*  ) 
ax  x  xx  t  tt 

m  cos20X  sec  0  d2x/dt2  ^  -  3W/3x 

tt  XX 


(255) 


(257) 
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The  approximations  in  equations  (255)  through  (257)  are  not  nearly  as  good  as 
the  approximations  in  equations  (251),  (252)  and  (254)  respectively.  The  limit¬ 
ing  case  of  incoherent  space  and  incoherent  time  is  obtained  by  setting  all 
phase  angles  equal  to  zero  and  the  exact  equation  (246)  becomes 

m  d2x/dt2  =  -  3W/3x  (258) 

which  is  the  standard  form  of  Newton's  law  of  motion. 

B.  Coherent  Space  and  Incoherent  Time. 

The  case  of  coherent  space  and  incoherent  time  is  described  by 

3xx  "  n/2  9t  =  0  *tt  =  °  (259) 

where  0X  and  t  are  variables.  This  section  develops  a  form  of  Newton's  law  of 
motion  that  is  suitable  for  making  the  transition  to  the  case  of  coherent  space 
and  incoherent  time  given  by  equation  (259).  Combining  equations  (64)  and  (236)  gives 

-  m  esc  6  cos  0X  v  d6  /dt  =  -  sin  6  esc  B.nT  W/x  36TT/30  (260) 

vxvx  ttxvx  xx  WW  W  x 

Equivalently,  combining  equations  (65)  and  (236)  gives 

-  m  esc  0  cos20X  esc  0  x  d0  /dt  d0  /dt  (261) 

vxvx  tt  xx  x  vx 

■  -  sin  0  esc  0  W/x  30-  /30 
XX  WW  W  x 

where  dO^/dt  is  given  by  equation  (60). 

For  the  limiting  case  of  coherent  space  and  incoherent  time  given  in  equa¬ 
tion  (259),  it  follows  that  equation  (261)  becomes 

~  ®  CSC  ^vxvx  X  <dex/dt)2  “  “  csc  w/x  36w/39x  (262) 

Using  equations  (71)  through  (74)  with  equation  (262)  gives 

-  mxt(E^)2  +  (F^)2]1/2  =  -  csc  0W  W/x  39w/30x  (263) 

qI  oi 

where  Ext  and  Fxt  are  given  by  equations  (72)  and  (73).  If  the  potential  func¬ 
tion  changes  coherently  when  the  space  coordinate  changes  coherently  it  follows 
that  -  tt / 2  and  equation  (263)  becomes 

-  mx[(E^)2  +  (F^)2]172  =  -  W/x  36w/30x  (264) 


The  internal  phase  angle  equation  for  Newton's  law  of  motion  in  coherent  space 
and  incoherent  time  is  obtained  from  equations  (78) ,  (237)  and  (240)  to  be 


0ci+ 

ax 


0 

x 


+  K 


Xt 


*W 


0 

x 


(265) 
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where  <xt  is  given  by  equation  (76) . 

C.  Incoherent  Space  and  Coherent  Time. 

In  this  section  a  form  of  Newton's  dynamical  law  is  developed  for  broken 
symmetry  spacetime  that  can  be  used  to  attain  the  case  of  incoherent  space  and 
coherent  time  which  is  described  by 


0-0 

x 


$  »  0 
XX 


ett  -  *72 


(266) 


(267) 


where  x  and  0*  are  variables.  Combining  equations  (87)  and  (233)  gives 

-  m  S"  6V*V*  Sln  6tt  t"1  dVX/d^  ■  -  COS  Bxx  Sec  8* 

From  equations  (88)  and  (233)  it  follows  that  equation  (267)  can  be  written  as 

-  m  sec  fi  ^  sin  6*  t"1  d/d0*(sin  B*  sec  &  t‘l  dx/d0*)  (268) 

VXvX  LL  L  UL  XX  £ 

»  -  cos  Bxx  sec  Byy  9W/9x 


Equations  (267)  and  (268)  are  completely  general  equations. 


For  the  limiting  case  of  incoherent  space  and  coherent  time,  equation 
(268)  becomes 


-  m  sec  8^^  t-2  d2x/d0*2  -  -  sec  8^  9W/9x 


(269) 


Using 


equations  (96)  through  (98)  with  equation  (269)  gives  with  B^ 
-  mt‘2[(E^)2  +  (F^)2]172  -  -  9W/9x 


0 


(270) 


ic  ic 

where  Ext  and  Fxt  are  given  by  equations  (96)  and  (97).  The  corresponding  phase 
angle  condition  for  Newton's  law  of  motion  is  obtained  from  equations  (92),  (237) 
and  (241)  to  be 


0ic' 

ax 


-ic 

3  VXVX 


W 


(271) 


ic 

where  0W  -  constant  and  -  0  ,  and  where  &yxvx  given  by  equation  (95) . 
Equations  (270)  and  (271)  represent  Newton's  dynamical  law  of  motion  for  inco¬ 
herent  space  and  incoherent  time. 

D.  Coherent  Space  and  Coherent  Time. 

This  section  considers  a  formulation  of  Newton's  law  of  dynamics  in  broken 
symmetry  spacetime  that  can  be  reduced  to  the  limiting  case  of  totally  coherent 
spacetime  which  is  described  by 

Bxx  *  */2  *tt  *  ir/2  (272) 
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where  0X  and  0fc  are  variables.  The  proper  form  of  the  law  of  motion  for  this 
case  is  obtained  from  equations  (101),  (106),  (107),  (230)  and  (236)  which  give 


-  m  esc  6  sin  0X  v  /t  d9  /d0X 
vxvx  tt  x  vx  t 


(273) 


-  esc  sin  0^  W/x  36w/36x 


Equation  (273)  can  be  rewritten  using  equation  (109)  as  follows 

-  mx/t2  esc  0  sin20X  esc  0  d0  /d6X  d8  /d6x 
vxvx  tt  xx  x  t  vx  t 


(274) 


=  -  esc  0  sin  0  W/x  30/36 
WW  xx  W  x 


Equation  (274)  can  be  rewritten  using  equation  (39)  to  give  the  following  approx¬ 
imation  for  slowly  varying  0VV  and  0* 

xx  tt 


-  mx/t2  esc  0  sin20x  esc  0  d9  /d0x(d9  /d6x  -  1) 
vxvx  tt  xx  x  t  x  t 


(275) 


^  "  CSC  0WW  sin  6xx  W/X  39W/36x 


Combining  equations  (249)  and  (275)  gives  the  further  approximation 

-  mx/t2  esc  0  sin20X  esc  0  d0  /d9X(d8  /d9X  -  1) 
vxvx  tt  xx  x  t  x  t 


(276) 


^  -  W/x  39  /30 
W  x 

Equations  (110)  and  (236)  give  an  equivalent  form  of  Newton's  law  of  motion 

-  mt  1  sec  0  sin  0X  d/d9X(sin  0X  esc  0  x/t  d0  /d0X)  (277) 

VXVX  t  C  t  1 1  XX  X  c 

*  -  esc  0T_7  sin  6  W/x  3Q../30 
WW  xx  W  x 


Equations  (274)  and  (277)  are  equivalent  to 

-  mt"1^  +  D2t)1/2  sin  0Xt  =  -  esc  0^  sin  0xx  W/x  39w/39x 


(278) 


-v  -  W/x  39tJ/39 

W  X 

where  Cxt  and  Dxt  are  given  by  equations  (112)  through  (116).  The  corresponding 
phase  angle  equations  (111),  (237)  and  (242)  gi’-e 


0+0  +0  -  2 (9_  +  0  )  +  ti  *  0tT  +  0T_T  -  0  -  B„ 

x  vxvx  xx  v  t  tt  W  WW  x  xx 


(279) 


which  is  valid  for  nearly  coherent  space  and  nearly  coherent  time. 
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The  case  of  coherent  spacetime  corresponds  to  equation  (272),  and  equa¬ 
tion  (276)  becomes 


-  tnxt-2  esc  BC  d0  /d6X(d8  /d8x  -  1)  =  -  W/x  98„/38 

vxvx  x  t  x  t  W  x 

which  can  be  rewritten  using  equations  (125)  through  (128)  as 

-  mxt"2[(E^t)2  +  (F^t)2]i/2  =  -  W/x  3ew/98x 


(280) 


(281) 


where  E^t  and  Fxt  are  given  by  equations  (126)  and  (127).  Equation  (281)  is  the 
coherent  spacetime  limit  of  equation  (278).  The  phase  angle  condition  for  Newton's 
^aw  of  motion  in  coherent  spacetime  is  obtained  from  equations  (134),  (237)  and 
(242)  or  directly  from  equation  (279)  to  be 


8C  =8  -  28X  +  6 
ax  x  t  x  t 


8 

x 


(282) 


where  Sxt  is  given  by  equations  (131)  or  (132).  Equivalently,  equation  (282) 
can  be  rewritten  as 


2(8x  - 


ex) 


4-  <5 


xt 


=  0 


W 


(283) 


Equations  (281)  through  (283)  are  equivalent  to  the  complex  number  form  of  Newton' s 
dynamical  law  of  motion  given  in  equation  (230)  which  for  coherent  spacetime  is 
written  as 

mx/t2[d0x/d8x(d0x/dex  -  1)  -  jd28x/d9x2]  =  -  W/x  90w/36x  (284) 

For  a  free  particle  moving  in  coherent  spacetime 

86w/38x  =  0  (285) 

and  two  possible  solutions  to  equation  (284)  can  be  found 


6x  =  Cj  (286) 

®x  “  6t  +  C2  (287) 


where  c-  and  C2  are  constants.  These  solutions  can  also  be  deduced  from  equa¬ 
tion  (281).  Equation  (286)  represents  a  state  of  rest  for  internal  motion,  and 
equation  (287)  represents  a  state  of  uniform  motion  in  internal  spacetime. 

E.  Simple  Harmonic  Oscillator. 

Perhaps  the  most  elementary  system  in  mechanics  is  the  simple  harmonic 
oscillator.  ~6  For  broken  symmetry  spacetime  the  complex  number  potential 
energy  for  the  simple  harmonic  oscillator  is  given  by 
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W  =  l/2kx2 


W  =  l/2kx" 


0„  =  0,  +20 

V/  k  x 


(288) 


The  equations  of  motion  for  the  simple  harmonic  oscillator  are  now  considered 
for  four  limiting  spacetime  conditions. 

Case  a.  Incoherent  Space  and  Incoherent  Time. 


Equation  (258)  gives 

d2x/dt2  +  kx  ”  0 


(289) 


which  is  the  standard  equation  for  the  simple  harmonic  oscillator. 

Case  b.  Coherent  Space  and  Incoherent  Time. 

Equations  (264)  and  (265)  give 

-  »[«0Z  +  (f'J)2]1/2  +  k  -  0 

"xt  '  6k 
c  i  c  x 

where  E^j.  and  Fxt  are  given  by  equations  (72)  and  (73). 

Case  c.  Incoherent  Space  and  Coherent  Time. 

Equations  (270)  and  (271)  give 

-  mt~ 2 [ (EiC) 2  +  (FiC)2] 1/2  +  kx  -  0 

xt  xt 

SiC  -  20*  a  0 
vxvx  t  k 

ic  xc 

where  E^  and  Fxt  are  given  by  equations  (96)  and  (97). 

Case  d.  Coherent  Space  and  Coherent  Time. 

Equations  (281)  and  (282)  give 

-  mt*-2[(E^t)2  +  (F^)2]1/2  +  k  =  0 


(290) 

(291) 


(292) 


(293) 


(294) 


6  -  26>  0. 
xt  t  k 


(295) 


where  E^t  and  Fxt  are  given  by  equations  (126)  and  (127).  It  is  clear  from 
these  cases  that  the  simple  harmonic  oscillator  can  undergo  internal  spacetime 
motion. 


F.  Measured  Force. 

The  measured  force  is  given  by  equations  (43)  through  (47) ,  (244)  and 
(273)  to  be 
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rax 


F  cos  0_ 
x  Fx 


F  cos  0 
x  ax 


raa 


mx 


=  m  cos  sec  6  cos  0  dv  /dt 

tt  vxvx  ax  x 

-  m  sin  6*  esc  0  cos  0  v  /t  dfi  /d0x 

tt  vxvx  ax  x  vx  t 

Combining  equations  (203),  (296)  and  (297)  gives  the  measured  force  as 

F  »  m_  a  **  m_  dv  /dtx 
mx  Fx  conx  Fx  rax  m 

with  the  effective  mass  mp^.  given  by 

X  x 

m_  ”  m/h  cos  (1  sec  6  sec  0  cos  0_  cos  0 
fx  x  tt  vxvx  vx  t 


ax 


(296) 

(297) 

(298) 

(299) 

(300) 


where  1^  is  defined  in  equation  (193).  For  broken  symmetry  spacetime  the  eff¬ 
ective  mass  mpx  relates  the  measured  force  to  the  conventionally  defined  accel¬ 
eration  in  the  x  direction,  and  m^  f  mpX  where  mpX  is  given  by  equation  (185). 
The  measured  time  in  the  x  direction  is  given  by  equation  (8). 

F.  Conservation  of  Energy. 

The  obvious  generalization  of  the  law  of  conservation  of  energy  to  the 
case  of  broken  symmetry  spacetime  is1*-6 


p2/(2m)  +  W  -  E 


(301) 


which  can  be  rewritten  as  two  scalar  equations 

p2/(2m)  cos (20  )  +  W  cos  0  *  E  cos  0_  (302) 

p  w  t 

p2/(2m)  sin(20p)  +  W  sin  *  E  sin  0^  (303) 

or  equivalently  as 

[p2/ (2m)  ]2  +  p2/ra  W  cos (20  -  0  )  +  W2  -  E2  (304) 

P  " 

tan(26p)  «  (E  sin  0^  -  W  sin  @W)/(E  cos  0^  -  W  cos  9y)  (305) 

The  value  of  the  momentum  magnitude  obtained  from  equation  (304)  is 

p2  -  -  2mW  cos(20p  -  0W)  +  2m[E2  -  W2  sin2(20p  -  @w)]1/2  (306) 

In  this  way  p  and  9p  are  obtained  in  terms  of  E  ,  9g  ,  W  and  0y  .  An  approximate 
solution  of  equation  (301)  gives 

p2/(2m)  +  W  -v  E  20  'v,  0  'v*  0_  (307) 

p  w  & 
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which  assumes  that  each  term  in  equation  (301)  has  the  same  value  of  internal 
phase  angle. 

4.  CONCLUSION.  Broken  spacetime  symmetries  are  described  by  internal 
phase  angles  of  the  space  and  time  coordinates,  and  affect  the  basic  kinematic 
and  dynamical  equations  of  motion  of  Newtonian  mechanics.  The  kinematic  and 
dynamic  variables  such  as  particle  speed,  momentum,  acceleration,  force  and 
energy  must  also  have  internal  phase  angles  and  be  represented  as  complex 
numbers  in  an  internal  space.  The  internal  phase  angles  of  the  space  and  time 
coordinates  represent  additional  degrees  of  freedom  for  a  particle  and  allow 
an  internal  motion  to  occur  in  matter  for  fixed  magnitudes  of  the  space  and 
time  coordinates.  Elementary  mechanical  systems,  such  as  the  simple  harmonic 
oscillator,  can  experience  internal  motions  in  space  and  time. 
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SLOW  AND  ULTRAFAST  WAVE  PROPAGATION  PROCESSES 


Richard  A.  Weiss 
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ABSTRACT .  A  theory  of  wave  propagation  in  space  and  time  with  broken 
internal  symmetries  is  developed  from  which  the  special  cases  of  slow  and 
ultrafast  processes  associated  with  the  wave  propagation  can  be  obtained  as 
limiting  cases.  Physical  quantities  and  space  and  time  coordinates  are 
represented  as  complex  numbers  in  an  internal  space.  Ultrafast  processes  are 
associated  with  the  coherent  rotation  of  complex  number  physical  quantities  in 
an  internal  space  while  the  magnitudes  are  held  fixed.  Slow  processes  are 
associated  with  changes  in  the  magnitudes  of  the  physical  quantities .  Spacetime 
can  be  coherent  in  which  case  the  complex  number  space  and  time  coordinates 
change  by  rotations  in  an  internal  space  while  the  coordinate  magnitudes  remain 
fixed,  or  spacetime  can  be  incoherent  in  which  case  the  changes  in  spacetime 
coordinates  occur  as  variations  of  the  magnitudes  of  the  coordinates.  Eight 
possible  special  cases  of  wave  propagation  processes  in  asymmetric  spacetime  are 
delineated  according  to  whether  a  process  is  slow  or  ultrafast,  space  is 
incoherent  or  coherent,  and  time  is  incoherent  or  coherent.  The  ultrafast 
processes  described  in  this  paper  can  be  studied  by  femtosecond  laser  light 
pulses.  The  coherent  spacetime  condition  is  associated  with  the  superconducting 
state  of  a  high-Tc  compound  while  the  partially  coherent  spacetime  condition  is 
associated  with  the  normal  state  of  a  high-Te  material. 

1 .  INTRODUCTION .  The  intense  interest  in  ultrafast  processes  has  been 
stimulated  by  the  development  of  very  short  (femtosecond)  laser  pulses  as  a 
diagnostic  tool  for  studying  processes  that  occur  on  short  time  scales .  1-12  These 
processes  include  chemical  reactions,  optical  dynamics  of  molecules,  laser 
induced  plasmas  and  their  radiations,  chemical  explosions  and  many  others.1'13 
Ultrafast  laser  sources  have  been  developed  that  operate  in  the  infrared,  visible 
and  ultraviolet  regions  of  the  electromagnetic  spectrum.  Incoherent  femtosecond 
x  ray  emissions  have  been  observed  from  laser  induced  plasmas,  and  can  be  used 
for  studying  ultrafast  atomic  and  molecular  processes  in  gases,  liquids  and 
solids.1'8  The  dynamical  behavior  of  matter  at  femtosecond  time  scales  gives  a 
picture  of  molecular  dynamics  such  as  occurs  in  relaxation  and  transport 
phenomena.1'12  These  studies  can  be  made  at  surfaces  and  interfaces  as  well  as 
within  bulk  matter.  Specific  phenomena  that  can  be  studied  include:  fluid 
transport,  wave  propagation  and  phonon  interactions,  diffusion  and  heat  flow, 
electromagnetic  emissions,  vibrations,  adsorption  and  desorption,  phase 
transitions  and  chemical  reactions  such  as  detonations.1'13  Processes  that  occur 
in  supernova  explosions  such  as  the  rapid  neutron  capture  process,  neutrino 
emissions  and  shock  wave  transmission  also  occur  on  short  time  scales.14'18  All 
of  the  above  processes  can  occur  in  matter  that  is  located  in  incoherent 
spacetime  which  appears  in  ordinary  matter,  or  in  coherent  spacetime  which  occurs 
in  the  high-Te  superconducting  state  of  matter  or  in  matter  located  in  very 
strong  electromagnetic  or  gravitational  fields.  This  paper  studies  the  pro¬ 
pagation  of  waves  that  are  associated  with  short  time  scale  energy  transfers,  and 
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develops  the  forms  taken  by  the  wave  equation  for  slow  and  ultrafast  wave  pro¬ 
pagation  processes  that  occur  in  space  and  time  that  have  broken  internal  sym¬ 
metries  due  to  a  special  structure  of  matter  or  to  the  presence  of  an  external 
field  such  as  gravitation  or  electromagnetism. 

In  the  presence  of  external  fields  or  in  the  vicinity  of  a  peculiar  atomic 
or  molecular  structure,  the  space  and  time  coordinates  must  be  represented  as 
complex  numbers  as  follows17 


a  =  a  exp(j6a)  (1) 

where  a  =  x  ,  y  ,  z  ,  and 

t  =  t  exp(j0t)  (2) 

All  physical  quantities,  with  the  exception  of  the  light  speed  in  the  vacuum, 
have  broken  internal  symmetries  and  must  be  represented  as  complex  numbers  In  an 
internal  space.17  This  includes,  for  example,  pressure,  entropy,  energy  and  mag¬ 
netic  and  electric  field  strengths.  Therefore  for  the  case  of  wave  propagation 
the  amplitude  of  the  waves  must  be  represented  as  a  complex  number  in  internal 
space  as  follows 

*  -  ¥  expCjG^)  (3) 

Strictly  speaking,  the  value  of  the  internal  phase  angle  of  the  time  is  associ¬ 
ated  with  the  particular  physical  quantity  which  is  varying  with  time,  so  that 
for  the  case  at  hand 


t  exp(j0t) 


(A) 


where  0^  *  internal  phase  angle  of  time  that  is  accociated  with  the  time  varia¬ 
tion  of  the  wave  function  'V  .  Space  is  taken  to  be  homogeneous  so  that  the  in¬ 
ternal  phase  angle  of  time  0^  is  independent  of  the  internal  phase  angles  0a  of 
the  space  coordinates.  In  other  cases,  such  as  particle  dynamics  and  kinematics, 
the  space  and  time  coordinates  are  not  independent  and  the  internal  phase  angles 
of  the  space  coordinates  0a  are  each  associated  with  a  corresponding  internal 
phase  angle  of  time  8“  for  a  -  x  ,  y  ,  z  with  30a/3$“  #  0  .  But  for  the  case  of 
wave  propagation  the  time  and  space  coordinates  are  taken  to  be  independent 
parameters  so  that  9^  and  0a  are  unrelated  quantities  with  30a/3 0^  *  0  .  In 
general 


¥  -  'i,(a,ea,t,0t) 


-  8y( a*ea»t»et) 


(5) 


(6) 


where  a  «  x  ,  y  ,  z  .  From  equations  (1),  (2)  and  (3)  it  follows  that 

da  *  sec  B  da  expM  (8  +  B  )] 

aa  a  aa 

■  esc  B  ad0  exp(j(6  +  B  )] 
aa  a  r  J  a  aa' 


(7) 

(8) 
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dt  =  sec  B^c  dt  exp[j(0^  +  B^)]  (9) 

u;  Uf  vu  y 

*  CSC  Btt  tdet  exp[j(6t  +  8tt)]  (10) 

di1  =  sec  ST¥  d»  exp[j(0y  +  B^)]  (11) 

■  esc  8n  tde^  exp[j(e4,  +  Bvt)]  (12) 

where 

tan  Baa  =  a30a/aa  (13) 

tan  B^t  -  t30^/3t  (14) 

tan  8^  =  H'30^/34'  (15) 


These  expressions  will  be  used  in  Sections  2  and  3  to  obtain  the  first  and  sec¬ 
ond  derivatives  of  the  wave  function  with  respect  to  the  space  and  time  coordinates. 

This  paper  develops  a  general  theory  of  wave  propagation  in  space  and  time 
that  have  broken  internal  symmetries.  The  wave  function  must  also  be  represent¬ 
ed  as  a  complex  number  in  internal  space,  and  therefore  the  possibility  exists 
in  nature  of  having  slow  wave  propagation  processes  in  which  the  wave  function 
magnitude  changes  in  space  ana  time,  and  ultrafast  wave  propagation  processes 
in  which  the  wave  function  rotates  in  internal  space  with  a  constant  magnitude. 
The  space  and  time  coordinates  themselves  can  also  change  in  an  incoherent  and 
a  coherent  manner,  so  that  in  fact  there  are  eight  possible  limiting  cases  of 
wave  propagation  in  asymmetric  spacetime.  The  paper  is  briefly  organized  as 
follows:  Section  2  evaluates  the  first  and  second  derivatives  of  the  wave  func¬ 
tion  with  respect  to  space  and  time  coordinates;  Section  3  develops  the  general 
forms  of  the  wave  equations  that  can  be  specialized  to  the  eight  limiting  cases 
of  slow  and  ultrafast  processes,  coherent  and  incoherent  space ,  and  coherent  and 
incoherent  time;  and  Section  4  gives  the  solutions  to  the  eight  limiting  types 
of  wave  equations. 

2,  SPACE  AND  TIME  DERIVATIVES.  This  section  evaluates  the  space  and  time 
derivatives  that  enter  the  wave  equation  for  space  and  time  with  broken  internal 
symmetries.  The  wave  equation  for  space  and  time  with  broken  internal  symmetries 
is  written  as  the  following  generalization  of  the  standard  scalar  wave  equa¬ 
tion18-21* 

l  32?/3a2  -  c*232f/3t2  (16) 

a 

where  the  sum  is  over  a  *  x  ,  y  ,  z  and  where  the  complex  number  space  and  time 
coordinates  5  and  t  are  given  by  equations  (1)  and  (2)  respectively.  The  com¬ 
plex  number  wave  function  is  written  as 

5  -  T  exp(j04,)  .  (17) 
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and  the  complex  number  wave  speed  is  written  as 

c  =  c  exp(j0c)  (18) 

The  complex  number  wave  speed  for  sound  waves  can  be  written  as17 

c2  =  K/p  =  dP/dp  (19) 

where  K  =  complex  number  bulk  modulus,  and  p  =  mass  density.  For  electromag¬ 
netic  waves  in  matter  the  wave  speed  is  written  as25 

c2  *  (eu)”1  c2  =  (eu)_i  29c  -  -  0£  -  0y  (20) 

where  e  and  p  =  complex  number  electric  permitivity  and  magnetic  permeability 
respectively.  For  electromagnetic  waves  in  the  vacuum  with  no  external  fields 
present  0C  =  0  .17,2S  in  general  for  wave  propagation  in  matter  0C  is  a  small 
number.  In  order  to  delineate  the  various  wave  equations  derived  from  equation 
(16)  for  the  cases  of  slow  and  ultrafast  processes  and  for  various  types  of 
space  and  time  variations,  it  is  necessary  to  evaluate  the  second  derivatives  of 
the  wave  function  with  respect  to  space  and  time  as  are  required  by  equation  (16). 

A.  First  Derivatives  with  Respect  to  Space  and  Time. 

The  first  derivatives  of  the  wave  function  with  respect  to  space  and  time 


are  written  as 

v  «  v  exp ( j  0  )  =  3¥/3a  (21) 

a  a  K  J  va 

u  ”  u  exp(j0u)  =  3¥/3t  (22) 

where  a  *  x  ,  y  ,  z  .  For  the  space  derivatives  in  equation  (21)  the  magnitudes 
va  can  be  written  as 

va  **  sec  cos  8aa  3¥/3a  (23) 

3  esc  cos  6aa  V  36^/30  (24) 

-  sec  8^  sin  8aa  a~l  3¥/36a  (25) 

*  CSC  6^  sin  8aa  ¥/a  36^/30^  (26) 

and  the  internal  phase  angles  as 


va 


®¥  +  6¥¥  6a  eaa 


(27) 


where  8oa  and  8^  are  given  by  equations  (13)  and  (15)  respectively.  The  mag¬ 
nitude  of  the  time  derivative  that  appears  in  equation  (22)  is  written  as 
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>v 


=  sec 

cos 

T 

Btt 

SY/St 

(28) 

=  CSC 

Byy 

cos 

t* 

tt 

V  30 

y/3t 

(29) 

=  sec 

sin 

S4' 

tt 

t-1 

(30) 

=  CSC 

sin 

Sv 

tt 

4</t 

3^/30* 

(31) 

while  the  internal  phase  angle  is  given  by 


0 

U 


ey  +  Byy 


(32) 


where  0tt  is  given  by  equation  (14)  and  0t  =  internal  phase  angle  of  time  that 
is  associated  with  the  time  variation  of  the  wave  amplitude  V  . 


Four  limiting  forms  of  the  first  derivative  of  the  wave  function  with  re¬ 
spect  to  the  spatial  coordinates  will  now  be  considered. 

Case  1.  Slow  Process  and  Incoherent  Space. 

This  is  described  by 


0„ 


Bn  =  0 


Then  equations  (23)  and  (27)  give 
vSl  =  3¥/3a 


ea»o 


esl  =  o 

va 


S  =0 
aa 


(33) 


(34) 


Case  2.  Ultrafast  Process  and  Incoherent  Space. 
This  case  is  given  by 


0„ 


tt/2 


6=0 

a 


0. 


W  -  a  '  "aa 

Equations  (24)  and  (27)  become  for  this  case 


ui 


e 


ui 

va 


0^  +  tt/2 


or  equivalently 


-ui 

v 

a 


j'P30f/3a 


(35) 


(36) 


(37) 


Case  3.  Slow  Process  and  Coherent  Space. 
The  following  conditions  hold  for  this  case 


0 


6 


6 


aa 


tt/2 


(38) 
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and  equations  (25)  and  (27)  become 

,sc 


vSC  =  a“i34'/36 
a  a 


or  equivalently 

-  j/5  34730 


q—  =  _  e  _  n/2 
va  a 


-sc 

v 


(39) 


(40) 


Case  4.  Ultrafast  Process  and  Coherent  Space, 
This  case  is  described  by 


Byy  -  */ 2 

and  equations  (26)  and  (27)  give 


@  =  tt/2 

aa 


vuc  -  'P/a  ae^/ae 
a  V  a 


jUC 

va 


6^  -  9 
¥  a 


or  equivalently 


vuc  »  f/a  30/36 
a  f  a 


(41) 


(42) 


(43) 


These  are  the  four  limiting  cases  associated  with  the  first  derivative  with  re¬ 
spect  to  the  spatial  coordinates. 

Now  the  four  limiting  conditions  will  be  given  for  the  first  derivative  of 
the  wave  function  with  respect  to  time. 

Case  1.  Slow  Process  and  Incoherent  Time. 

This  case  is  given  by 


-  0 


B. 


Y 

9t-° 


tt 


(44) 


and  equations  (28)  and  (32)  become 


si 


3Y/3t 


3Si 

u 


0 


(45) 


Case  2.  Ultrafast  Process  and  Incoherent  Time. 
This  case  is  described  by 


B« 


tt/2 


0 


¥ 


¥¥  t 

Equations  (29)  and  (32)  then  give 
ui 


u 


-  'paey/at 


tt 


eui  -  ew  +  w/2 

U  T 


or 


Uui  -  J¥30w/at 


(46) 


(47) 


(48) 
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Case  3.  Slow  Process  in  Coherent  Time. 


The  following  conditions  are  valid  for  this  case 

%  =  0  8^-0  =  it/2  (49) 

while  equations  (30)  and  (32)  give 

usc  =  t_13V/3et  8®c  -  -  e  -  tt/2  (50) 


or 


SSC  =  -  j/t  3Y/30t  (51) 


Case  4.  Ultrafast  Process  in  Coherent  Time. 
This  case  is  described  by  the  following  conditions 


Syy  =  */2 

6«  -  */2 

(52) 

and  equations  (31)  and  (32)  give 

uuc  =  v/t  se/sej 

C  *  -  »? 

(53) 

which  can  be  rewritten  as 


uUC  =  5/t  30^/36*  (54) 

B.  Second  Derivatives  with  Respect  to  Space. 

The  second  derivatives  of  the  wave  function  with  respect  to  the  i  'atial 
coordinates  will  now  be  represented  in  four  general  forms  which  can  be  special¬ 
ized  to  four  limiting  cases  of  physical  interest  corresponding  to  slow  and  fast 
wave  propagation  processes  in  incoherent  and  coherent  space.  The  second  deri¬ 
vative  of  the  wave  function  with  respect  to  space  coordinates  is  written  as 

=*  exp(j0?a)  =  32?/3a2  *  3va/3a  (55) 

where  a  *  x  ,  y  ,  z  ,  and  where  va  is  defined  in  equation  (21). 

Case  1.  Slow  Process  in  Incoherent  Space. 

A  general  expression  for  the  second  spatial  derivative  of  the  wave  func¬ 
tion  will  be  derived  which  can  be  used  to  pass  to  the  limit  of  a  slow  process 
in  incoherent  space  which  is  described  by 

0,-0  ew  -  o  6a  -  0  aaa  -  o  (56) 

for  a  ■  x  ,  y  and  z  .  Equations  (23)  and  (55)  give 
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E  -  sec  8  cos  g  3v  /3a 
a  vava  aa  a 

*  Sec  Bvava  COS  6aa  3/!kt(sec  By4,  cos  3^  W/da) 

*  sec  Bvava  COS\a  sec 
while  equations  (27)  and  (55)  give 

e.  =  0  +  g  -e-e 

t,a  va  vava  a  aa 

=  0,,,  +  Bu,™  +  6  -  2(0  +g  ) 

¥  4*4'  vava  a  aa 

where  6vava  is  given  by 


tan  8  =  v  30  /3v 

vava  a  va  a 


where  va  is  given  by  equation  (23)  and  0vct  by  equation  (27) .  For  this  case  Q^a 
is  a  small  number.  In  the  limiting  case  of  a  slow  process  in  incoherent  space 
equation  (56)  is  valid  and  equations  (58)  and  (61)  become 


SS1  =  324'/3a2 
a 


0?1  =  0 


which  is  the  conventional  result. 

Case  2.  Ultrafast  Process  in  Incoherent  Space. 

This  section  derives  a  general  equation  for  the  second  derivatives  of  the 
wave  function  with  respect  to  the  space  coordinates  which  can  be  utilized  to 
obtain  the  limiting  case  of  an  ultrafast  process  in  incoherent  space  which  is 
defined  by 


8n  *  */ 2  ea  “  0 


g  *  0 
aa 


where  0^  is  now  a  variable.  From  equations  (24)  and  (55)  it  follows  that 


CSC 

8 

cos  8 

v  30 

vava 

aa 

a  va 

CSC 

8 

2 

cos  g 

csc 

vava 

aa 

where  8vav0t  is  given  by  equation  (62)  with  va  given  by  equation  (24)  and  where 
equation  (27)  gives 


36  /3a  -  3/3a(0,„  +  g^  -  0  -  g  ) 

va  x  ¥  H'Hf  a  aa' 


The  corresponding  internal  phase  angle  for  the  second  spatial  derivative  is 
given  by 


(68) 


(81) 


CSC  eui  -  [(Ej1)2  +  (F^1)2] 1/2/E^i 
vava  fa  fa  S' a 


Equations  (76)  and  (81)  give 

“  4't(Ey^)2  +  <)2]1/2 


(82) 


For  the  signs  chosen  in  equations  (79)  and  (80)  it  follows  from  equation  (78)  that 
,ui 


8~~  “  n/2  + 

vava  fa 


(83) 


where  tc^,a  is  a  small  positive  number  defined  by 
tan  k„ 


fa 


FuiS/Eui 
fa1 'fa 


(84) 


Equations  (77)  and  (83)  give 
,ui 


0 


C<* 


8  Y  4*  TT 


(85) 


Finally  equations  (71)  and  (72)  give 
_uit 


a 

uit 


-  't't(E^)2  +  (F^)21 1/2 


®f  +  Kfa 


(86) 

(87) 


.uit 


so  that  is  a  small  number. 

Case  3.  Slow  Process  in  Coherent  Space. 


An  expression  is  derived  for  the  second  derivative  of  the  wave  function 
with  respect  to  the  spatial  coordinates,  which  can  be  used  to  pass  to  the  limit 
of  a  slow  process  in  coherent  space  whose  characteristics  are 


-  0 


-  0 


8  -  w/2 

aa 


(88) 


where  8a  is  variable.  From  equation  (55),  (25)  and  (61)  it  follows  that 

£  *  sec  8  sin  8  a  *  9v  / 3 0  (89) 

a  vava  aa  a  a 

-  sec  B  sin  8  a"1  9/30  (sec  8^  sin  8  a  1  9f/90  )  (90) 

vava  aa  a  ff  aa  a 


0£a  8f  +  Sff  +  Bvava  2(6a  +  8aa^ 


(91) 


where  8vava  is  given  by  (25) ,  (27)  and  (62) .  In  this  case  it  is  convenient  to 
introduce  another  representation  of  the  second  spatial  derivative,  namely 
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(92) 


?a  -  Ca  exp (j 0£a)  «  exp(j0ja) 

**  S^/Sa2  =  3v  / 3a 
a 

where 


(93) 

(94) 


-  5, 


Ca 


0_  +  TT 

5a 


The  limiting  case  of  a  slow  process  in  coherent  space  is  obtained  from 
equation  (88)  which  combined  with  equations  (90)  and  (91)  gives 


sc 

a 

sc 

5a 


sec  8 


,sc 


sc 

vava 


a-2  32'i'/302 


8  -  26  -  it 

vava  a 


Equations  (39)  and  (62)  give 
,sc 


tan  8' 


vava 


fsc/fsc 

E1'a/F<t'a 


where 


’i®c  -  -  3y/39 
'Va  a 


,sc 

?4>a 


2  2 
3  f/30 

a 


7sc 

^a 

?sc 

■'t'a 


>  0 
>  0 


(95) 

(96) 

(97) 

(98) 

(99) 


sec  8 


sc 

vava 


l<E?a)2+  (F?a>2ll/2/^ 


(100) 


SC 

and  therefore  bvava  is  a  small  positive  angle.  Equations  (95),  (98)  and  (100)  give 

(101) 


_sc  -2r_,sc,. 2  .  /r,sc> 2, 1/2 

5«  ‘  “  [  V  +  (FVc,>  1 


The  alternative  description  of  the  second  derivative  with  respect  to  space  is 
obtained  from  equations  (93),  (94),  (96)  and  (101)  as 


.sc 


,sc 


-  °"2ke“>2  +  (P^)'l 


c, 2,1/2 


,sc 


ep  -  8  -  20 

^a  vava  a 
SC  * 

where  0^a  is  seen  to  be  a  small  angle. 


(102) 

(103) 


Case  4.  Ultrafast  Process  and  Coherent  Space. 


This  section  develops  a  representation  for  the  second  derivative  of  the 
wave  function  with  respect  to  the  space  coordinates  which  can  be  utilized  to 
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attain  the  limit  of  an  ultrafast  process  in  coherent  space  whose  description  is 


=  tt/2 


6  =  ir/2 

aa 


<104) 


The  magnitude  and  internal  phase  angle  of  the  second  derivative  is  obtained 
from  equations  (26),  (55),  (61)  to  be 


£  =  esc  6  sin  6  v  /a  30  /30 

a  vava  aa  a  va  a 

'  csc  Bvava  sln^Baa  csc  '!la%  3V39a  39va/39« 


0m  +  +  6  -  2(0  +  B  ) 

f  ¥¥  vava  a  aa 


(105) 

(106) 

(107) 


where  8vava  is  given  by  equation  (26)  and  (62),  and  where  from  equation  (27) 
it  follows  that 


30  /30  =  30/30  -  1  +  3/30  (gu/u/  -  6  ) 

va  a  ¥  a  a  ¥¥  aa 


(108) 


A  different  expression  for  the  second  derivative  can  be  obtained  from  equations 
(89)  and  (26)  and  is 


5a  ■  sec  Bvava  aln  8aa  3va/39a 

*  Sec  “vava  sln  6„a  “~l  S/38t,(csc  6T»  Sl”  Baa  T/“  3V3V 


(109) 

(110) 


A  comparison  of  equations  (106)  and  (110)  gives 


tan  6vava  *  Sa/D¥a 

csc  6  -  (C2  +  D2  )1/2/C 

vava  ¥a  'Va'  ¥a 

sec  8  «  (C2  +  D2  )1/2/Dw 

vava  v  ¥a  ¥a  ¥a 


(111) 

(112) 

(113) 


where 


Cw  =  sin  B  csc  Bu,™  ¥/a  30w/30  30  /30 

¥a  aa  ¥¥  ¥  a  va  a 


¥a 


3/30a(csc  Bfy  sin  8aa  ¥/a  38¥/36a) 


(114) 

(115) 


where  96va/39a  is  given  by  equation  (108)  and  where  Cipa  <  0  .  Therefore, 


C.„  a  *  csc  8  sin  8 
¥a  vava  aa 

D„,  a  *  sec  B  sin  B 
¥a  vava  aa 


(116) 

(117) 
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which  can  be  rewritten  as 


5  = 
a 


«& 


<>1/2  0-1 


sin  S> 


aa 


(116) 


It  is  convenient  for  the  case  at  hand  to  write  the  second  derivative  of  the  wave 
function  with  respect  to  space  coordinates  in  equation  (55)  as 


-  5a  exp(jeCa)  =*  c;  exp(j6-a) 
*  32?/3a2  =  Sv^/Sa 


(119) 


where 


5 '  -  -  Kn 

a  a 


01  -  0r  +  IT 

5a  5a 


(120) 

(121) 


which  gives  a  useful  alternative  description  of  the  second  derivative  with  re¬ 
spect  to  space. 


For  an  ultrafast  process  in  coherent  space  described  by  equation  (104)  it 
follows  from  equations  (106),  (110),  (118)  and  (120)  that 


ruc  *  Quc  ,  2  _uc  /m\ 

’  -  CSC  Bvova  ’/o  E»a  (122) 

-  -  sec  SUC  't/a2  F“c  (123) 

vava  H*a 

-  -  “_1t(C^)2  +  (D^)2]172  (124) 


-  -  fa'2[(E^)2  +  (F^)23172 

where 

E?a  *  *Y!»’a<3V9l’a  '  *> 

C  '  aV36e 
C  -  *'«  E£ 

From  equations  (42)  and  (62)  or  directly 


(125) 


vuc  <  n 

Set  *  0 

(126) 

>  0 

(127) 

(128) 

(129) 

from  equation  (111)  it  follows  that 
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tan 

3UC  - 
vava 

Euc/Fuc 
Ya'  Ya 

(130) 

CSC 

BUC 

vava 

[ (EUC) 2 

+ 

<)2] 

1/2 ,-Uc 
'  Ya 

(131) 

sec 

BUC  - 
vava 

[ (EUC) 2 

+ 

l/2/Fuc 

/FYa 

(132) 

Because  of  the  choice  of  signs  in  equations  (126)  and  (127)  it  follows  that 
,uc 


B“w  “  -  tt/2  +  6^ 
vava  H'a 


(133) 


where  6^ 
tan 


,  so  that 


(134) 


The  internal  phase  angle  of  the  second  derivative  then  follows  from  equations 
(104),  (107),  (121)  and  (133)  as 


e“c  -  +  buc  -  26  +  tt/2 

£a  Y  vava  a 

=  e.„  -  28  +  5 

V  a  4»cx 


(135) 

(136) 


so  that  8^£  is  a  small  number.  The  complex  number  second  derivative  with  re¬ 
spect  to  space  coordinates  that  corresponds  to  equations  (125)  and  (136)  is 
given  by 

-  (32?/3a2)UC  -  y/a2[38>1(/3eaO8¥/30a  -  1)  -  j320(})/302]  (137) 


for  an  ultrafast  process  in  coherent  spacetime. 

C.  Second  Derivative  with  Respect  to  Time. 

This  section  evaluates  the  second  derivative  of  the  wave  function  with 
respect  to  time  for  four  cases  of  physical  interest:  slow  and  ultrafast  pro¬ 
cesses  and  coherent  and  incoherent  time.  The  complex  number  second  derivative 
of  the  wave  function  with  respect  to  time  is  written  as 

lt  -  £t  exp(j8Ct)  -  32?/3t2  -  3u/3t  (138) 

where  u  is  defined  in  equation  (22). 

Case  1.  Slow  Process  in  Incoherent  Time. 

For  this  case  a  general  expression  for  the  second  derivative  of  the  wave 
function  with  respect  to  time  is  derived  which  allows  a  transition  to  the  lim¬ 
iting  case  of  a  slow  process  in  incoherent  time  which  is  described  by 

9^-0  B^  -  0  et  -  0  6It  ‘  0  (139) 
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Equations  (28),  (32)  and  (138)  give 


sec 

*uu 

cos 

8* 

tt 

3u/3t 

(140) 

sec 

euu 

cos 

fi4' 

6tt 

3/3t(sec  8^  cos  S*t  34-/3t) 

(141) 

sec 

6uu 

2 

cos 

:bt 

tt 

sec  Bw  3  2*P/ 3 1 2 

(142) 

vu  <Jf 

3  =6+8  -0-8 

ct  u  uu  t  tt 

\u  U/ 

=  6+8  +8  -  2(0  +8  ) 
<¥  VV  puu  '  t  tt' 


(143) 

(144) 


where 


tan  8  *  u38  /3u  (145) 

uu  u 

where  u  and  0U  are  given  by  equations  (28)  and  (32)  respectively. 

The  limiting  case  of  a  slow  process  in  incoherent  time  is  obtained  by  im¬ 
posing  the  conditions  in  equation  (139)  on  equations  (141)  and  (144)  with  the 
result  that 

C®1  =  32f/3t2  =  0  (146) 

which  is  the  standard  result. 

Case  2.  Ultrafast  Process  and  Incoherent  Time. 

The  general  expression  for  the  second  derivative  of  the  wave  function  with 
respect  to  time  will  now  be  given  which  produces  the  correct  limit  for  the  case 
of  an  ultrafast  process  and  incoherent  time  which  is  described  by 

-  ir/2  0t  -  0  6 Jt  -  0  (147) 


From,  equations 

(29) 

,  (32)  and  (138)  it  follows  that 

£  »  CSC 

8 

cos  bL  u  30  /3t 

(148) 

t 

uu 

tt  u 

*  CSC 

Suu 

cos28^t  esc  8^  V  36f/9t  36u/3t 

(149) 

>tt  -  6U  +  Suu  -  et  -  6tt 

-  \  +  +  ®uu  -  2<6I  +  *«> 


(150) 

(151) 


where  8UU  is  given  by  equation  (145)  with  u  and  0U  given  by  equations  (29)  and 
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(152) 


(32)  respectively,  and  where  equation  (32)  gives 
36u/3£  -  8/St(e,  +  e,,  -  8*  -  6*t) 

Define  the  following  alternative  representation  of  the  second  time  derivative 
of  the  wave  function  as  follows 

tt  =  ?t  exp(j6^t)  =  Z*  (153) 

-  32f/3t2  «  3u/3t 


where 


e 

5t 

so  that 

.+ 


- 


-  e. 


-  esc  g  cos  g  u  30  /3t 

uu  tt  u 

-  esc  guu  cos2gJt  esc  g^  y  ae^/at  seu/3t 


(154) 

(155) 

(156) 

(157) 


elt  -  +  6n  +  'u«  -  2<9t  +  6tt>  -  ’ 


(158) 


For  the  limiting  case  of  an  ultrafast  process  and  incoherent  time,  equa¬ 
tions  (147),  (149),  (151)  and  (152)  become 


sf-  -  esc  g“*  H-oe^/at)2 

6ri  ”  0„,  +  PUi  +  */2 

5t  y  uu 


(159) 

(160) 


where  from  equations  (47)  and  (145)  it  follows  that 


*-  c  - « 


where 


-  (30^/at)2 

Fn  “  d\'dt2 


Eui  >  0 

yt 

Fui  <  o 

yt 


Equation  (161)  gives 

„ui  r,..ui*2  ._ui.  2,  l/2/17ui 

esc  ^  1 


yt 


(161) 


(162) 

(163) 


(164) 
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(165) 


which  combined  with  equations  (159)  and  (162)  gives 

s"1  -  +  (f“J)2]1/2 

Equation  (161)  and  the  choice  of  signs  in  equations  (162)  and  (163)  give 


C  - 1/2  +  St 


(166) 


where  the  positive  angle  <vj(t  can  also  be  defined  by 


“  St  -  l*Kl< 


(167) 


and  equations  (160)  and  (166)  give 
9?t  *  04»  +  Kq»t  +  * 


(168) 


The  alternate  representation  in  equations  (154)  and  (155)  gives 

c“lf  -  -  »[<e“)2  +  <f“J)V/2 

*  S  +  St 

so  that  0“i+  is  a  small  angle. 

Case  3.  Slow  Process  and  Coherent  Time. 

For  this  case  the  following  conditions  are  valid 


(169) 


64,-0 


84,4,  -  0 


6tt  ' 


(171) 


and  a  general  expression  for  the  second  derivative  of  the  wave  function  with 
respect  to  time  is  required  which  will  accept  equation  (171)  as  a  limiting  case. 
Equations  (30),  (138)  and  (144)  give 


“  S6C  8uu  sin  SIt  t_I  3u/39I 


(172) 


«  sec  Buu  sin  t  *  3/38^(sec  8^  sin  8^t  t  1  3f/36^)  (173) 


9Ct  “  %  +  +  8uu  2(‘9t  +  Btt^ 

where  8UU  is  given  by  equation  (30),  (32)  and  (145) 
tion  introduced  in  this  case  is 

lt  -  exp(je5t)  -  Vt  exp(j0^c) 

-  32?/3t2  -  3u/3t 


(174) 


The  alternative  descrip- 


(175) 
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where 


“  - 


(176) 


+  ir 


(177) 


The  limiting  case  of  a  slow  process  in  coherent  time  is  obtained  from 
equation  (171)  which  combined  with  equations  (173)  and  (174)  gives 


Ksc  -  sec  esc  t-2  a^/aej2 
t  uu  t 

e“  =  $sc  -  26*  - 

?t  UU  t 

Equations  (50)  and  (145)  give 
_sc 


tan  6 


uu 


FSC /VSC 


where 


„SC 

^t 

„sc 


-  3T/30 


=  3  'P/3e(; 


4* 

t 

¥2 


E-  >  0 

f»t  *  0 


(178) 

(179) 

(180) 

(181) 

(182) 


asc  -  r/i?sc\2  J.  /'vsc\2i1/2/vsc 
sec  6uu  -  [  (En)  +  (Fn)  ]  /Fvt 


(183) 


so  that  6®£  is  a  small  positive  angle.  Equations  (178),  (182)  and  (183)  give 

(184) 


^  -  tf2[(E*£)2  +  (fJ*)2]172 


Finally,  the  alternative  description  of  the  second  derivative  of  the  wave  func¬ 
tion  with  respect  to  time  is  obtained  from  equations  (176),  (177),  (179)  and 
(184)  to  be 


-2r/_ SC.  2  . 

t  [ (E¥t)  + 


(F^)2]172 


(185) 


0 


sc1 

5t 


3SC 

uu 


(186) 


inhere  0|£'  is  a  small  angle. 

Case  4.  Ultrafast  Process  and  Coherent  Time. 

In  this  section  a  representation  of  the  second  derivative  of  the  wave 
function  with  respect  to  time  Is  determined  which  has  the  proper  form  to  be 
used  to  obtain  the  limiting  case  of  an  ultrafast  process  with  a  coherent  time 
variation  that  is  described  by 
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(187) 


-  */ 2 


St  -  ,/2 


For  this  case  the  proper  magnitude  and  internal  phase  angle  of  the  second  deri¬ 
vative  is  obtained  from  equations  (31),  (138)  and  (144)  to  be 


£  =  esc  S  sin  u/t  36  /30^ 

t  uu  tt  u  t 

=  esc  Buu  sin2B^t  CSC  8^  -?/t2  30/30*  30/30* 


(188) 


(189) 


6Ct  =  %  +  6W  +  euu  -  2<6t  +  <t> 


(190) 


where  8  is  given  by  equations  (31),  (32)  and  (145)  ,  and  where  equation  (65)  gives 


39u/3e^  *  30/30*  -  1  +  3/30*  (ew  -  8*t) 


(191) 


From  equations  (31)  and  (172)  a  second  form  of  the  second  derivative  of  the  wave 
function  with  respect  to  time  can  be  written  as 


£  =  sec  8  sin  0*  t  *  3u/30* 

t  uu  1 1  t 

=  SeC  euu  sin  6tt  C  1  3/'30t(csc  sin  ett  lF/t  30/36t> 


(192) 


(193) 


Combining  equations  (189)  and  (193)  gives 


taD  “uu  ■  St'St 

“<=  -  «4  +  D«)1/2/cn 

S»  '  <4  +  D«>l/2/Dtt 


where 


Cft  =  sin  8*t  esc  8^  tit  30/30*  30/30* 
D,t  “  4/d0*(csc  8^  sin  6*t  f/t  30/30*) 


(194) 


(195) 


(196) 


(197) 


(198) 


where  30u/30^  is  given  by  equation  (191).  The  magnitude  of  the  second  deriva¬ 
tive  of  the  wave  function  with  respect  to  time  can  then  be  written  as 


S  -  St  t_1  C3C  Su  sln  4 

-  St  t"‘  s“  Su  sln  Btt 

-<+D^)1/2  S', inti 


(199) 


(200) 


(201) 
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It  is  convenient  for  the  case  of  an  ultrafast  process  and  coherent  time  to  intro¬ 
duce  the  following  alternative  representation  of  the  second  derivative  with  re¬ 
spect  to  time 

Ct  =  5t  exp(j0?t)  =  Vt  exp(j0^t)  (202) 

-  a2?/at2  =  3u/at 

where 

5*  -  -  Ct  (203) 

0’t  =  05t  +  ,  (204) 

In  the  limit  of  an  ultrafast  process  and  a  coherent  time  variation,  de¬ 
scribed  by  equation  (187),  it  follows  that  equations  (189),  (193),  (201)  and 
(203)  become 


-  - csc  c f/t2  E?t 

=  -  sec  V/t2  F- 

*  -  t_1[(c^)2  +  (d"®)2]1/2 

-  “  »t‘2[(E^)2  +  (F^)2ll/2 


where 


En  "  9V39t(3Vs9t  '  0 


r,UC 

A 


?  V2 

3  V3V 


FUC  ^  0 
ft 


,uc 

>t 

uc 

ft 


ft_1EUC 

ft 

ft"lFUC 

ft 


(205) 

(206) 

(207) 

(208) 

(209) 

(210) 

(211) 

(212) 


From  equations  (47)  and  (145)  or  from  equation  (194)  it  follows  that 


tan 

BUC  - 
uu 

FUC /FUC 

Eft/Fft 

(213) 

CSC 

euc  - 
uu 

(^ft)”^  <Eft>2  + 

(214) 

sec 

euc  - 

uu 

(Fft)"ll(Eft)2  + 

<f£)V/2 

(215) 
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Combining  equation  (213)  with  the  choice  of  signs  in  equations  (209)  and  (210) 
gives 


6UC  =  -  tt/2  -H  6 
uu  'Ft 


where  6yt  is  a  positive  angle 
tan  6,t  -  f£/|e£| 

Equations  (187),  (190),  (204) 

QTt' m  Q* +  &uc  -  2eI  + 

£t  ¥  uu  t 


0,F  -  20fc  + 


given  by 


and  (216)  give 
tt/2 


(216) 


(217) 


(218) 

(219) 


and  therefore  is  a  small  angle.  Equations  (208)  and  (219)  are  equivalent 
to  the  following  complex  number  representation  of  the  second  derivative  of  the 
wave  function  with  respect  to  time 

c“c  -  02Y/at2)uc  =  ?/t2[38f/3eJoe¥/30^  -  l)  -  j32e¥/3ej2]  (220) 


which  corresponds  to  an  ultrafast  process  with  a  coherent  time  variation. 

3.  WAVE  EQUATIONS  IN  BROKEN  SYMMETRY  SPACETIME.  This  section  develops 
the  various  forms  taken  by  the  wave  equation  for  slow  and  fast  processes  that 
can  occur  in  coherent  and  incoherent  space  and  coherent  and  incoherent  time.  In 
three  dimensions  the  complex  number  wave  equation  is  written  as  the  following 
generalization  of  the  standard  scalar  form  of  the  wave  equation  in  cartesian 
coordinates18-24 

l  32’?/352  =  l/c232Y/3t2  (221) 

a 

where  a  ■  x  ,  y  ,  z  ,  and  where  the  complex  number  space  and  time  coordinates  5 
and  t  are  given  by  equations  (1)  and  (2)  respectively,  and  where 

5  -  *  exp(jef)  c  -  c  exp(j9c)  (222) 


For  wave  propagation  in  a  single  space  dimension  equation  (221)  becomes 

32?/3x2  -  l/c232;F/3t2  (223) 

From  equations  (55)  and  (138)  it  follows  that  the  wave  equation  (221)  can  be 
written  as 

ll  -  Ct/52  (224) 

a 


or 
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(225) 


l  Ca  exp(j0£a)  -  c-2Ct  exp[j(e^t  -  26c>] 

For  a  one  dimensional  case  equation  (225)  becomes 

Cx  sxp(j05x)  =  c"2Ct  exp[ j (6^t  -  26c)]  (226) 

where  8C  =  0  for  the  light  speed  in  the  vacuum. 

An  exact  solution  of  the  complex  number  wave  equation  (225)  requires  that 
the  real  and  imaginary  parts  be  taken,  but  this  leads  to  very  complicated  equa¬ 
tions.  A  simpler,  but  approximate,  way  of  solving  equation  (225)  is  to  make  the 
assumption  that  the  internal  phase  angles  of  each  of  the  component  terms  of 
equation  (225)  are  equal.  However,  because  the  phase  angles  on  either  side  of 
equation  (225)  or  (226)  can  have  a  zero  or  a  ±  ir  term  included,  as  shown  in 
Section  2  in  equations  (61),  (72),  (94),  (121),  (144),  (155),  (177)  and  (204), 
the  solutions  of  equations  (225)  or  (226)  require  that  the  eight  possible  pro¬ 
cess  and  spacetime  states  be  considered  individually,  so  it  follows  that 

*][€-*  (227) 

%  1  (1T*0)  *  V  “  28c  1  (tr’0)  (228) 

where  a  =  x  ,  y  ,  z  for  the  three  dimensional  case,  and  a  =  x  for  the  one  dimen¬ 
sional  case.  From  the  general  equations  (61)  and  (149)  it  is  clear  that  equa¬ 
tion  (228)  can  be  written  as 

B  -  2(0  +  B  )  ±  (it,0)  «  B  -  2(6^  +  B^J  -  28  ±  (n,0)  (229) 

vava  a  aa'  '  7  wuu  t  tt  c  ' 

which  gives  the  possible  phase  angle  conditions. 

The  wave  equation  components  given  in  equations  (227)  and  (228)  will  now 
be  investigated  for  eight  possible  states  associated  with  the  slow  or  ultrafast 
process  rates,  coherent  or  incoherent  space  coordinate  variation,  and  the  co¬ 
herent  or  incoherent  time  coordinate  variation. 

Case  a.  Slow  Process,  Incoherent  Space  and  Incoherent  Time. 

This  section  develops  a  general  form  of  the  wave  equation  that  can  be 
used  to  make  the  transition  to  the  limiting  case  of  a  slow  process  occurring  in 
incoherent  space  and  incoherent  time  which  is  described  by 

eT-°  -  0  8„  -  0  -  0  <  ■  »  »tt  -  0  (230) 

where  o  *  x  ,  y  ,  z  .  For  the  case  considered  all  internal  phase  angles  are  small 
numbers  so  that  the  approximate  solution  to  equation  (225)  can  be  obtained  from 
equations  (227)  and  (228)  to  be 
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(231) 


I  '  c'2?t 


e,  =  er^  -  20 

Za  £t  c 


(232) 


where  equation  (232)  is  valid  for  a  =  x  ,  y  ,  z  .  Equations  (58),  (141)  and  (231) 
give 


7  sec  8  cos  8  3/3a(sec  8,„u,  cos  6  3¥/3a) 

U  VCt  VCt  '  U/V  n  rt  ' 


aa 


n 


aa 


(233) 


-  c“2  sec  8uu  cos  8^t  3/3t(sec  8yy  cos  8^  34730 


Uf 

For  slowly  changing  values  of  Baa  ,  Byy  and  Btt  equation  (233)  becomes 

y  sec  6  cos2S  32473a2  'c  c"2  sec  8  cos^  32f/3t2 
£  vava  aa  uu  tt 


(234) 


where  8vava  an<^  ®uu  are  given  by  equations  (62)  and  (145)  respectively.  From 
equation  (229)  it  is  clear  that  equation  (232)  can  be  written  as 


8  -  2(0  +  8  )  =  8  -  2(6*  +  8*  )  -  26 
vava  a  aa  uu  t  tt  c 


(235) 


for  a  =  x  ,  y  ,  z  ,  where  8aa  and  0^t  are  defined  in  equations  (13)  and  (14)  respec¬ 
tively,  and  0a  and  0t  are  given  in  equations  (1)  and  (2). 


For  the  special  case  of  a  slow  process  in  incoherent  space  and  incoherent 
time  described  by  equation  (230) ,  it  follows  that  equations  (233)  or  (234)  reduce 
to  the  standard  form  of  the  wave  equation 


l  32473a2  =  c_2324</3ti 
a 


(236) 


For  the  one  dimensional  case  equation  (236)  becomes 

2  2  -22  2 
3  >F/3x  =  c  3  Y/3t 


(237) 


The  phase  angle  equation  (235)  yields  a  trivial  null  result  assuming  6C  **  0  for 
a  slow  process  in  incoherent  space  and  time. 

Case  b.  Slow  Process,  Coherent  Space  and  Incoherent  Time. 


In  this  section  a  general  form  of  the  wave  equation  is  developed  that  can 
be  used  to  pass  to  the  limit  of  a  slow  process  that  occurs  in  coherent  space 
and  incoherent  time  whose  description  is  given  by 


%  -  0 


■  0 


8  *  tt/2 

aa 


0t«O 


(238) 


for  a  *  x  ,  y  ,  z  .  Equations  (90)  ,  (92)  ,  (93) »  (141)  and  (227)  become 

-  '-2' 
a 


“  I  *n  -  c‘\ 


(239) 
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or  equivalently 


-  i  sec  8  sin  8  a  2  3/30  (sec  8  sin  8  a  1  3^/ 3 6  ) 

y  \in\7Ci  an  a  W  art  a' 


-l 


a 


vava 


aa 


aa 


(240) 


*  c"2  sec  8uu  cos  8^t  3/3t(sec  6^^  cos  8^  SH'/St) 
while  equations  (61),  (92),  (94)  and  (228)  give 


01  =  6.,.  -  26 
£a  £t  c 


(241) 


or 


+  it  =  e,.  -  20 

£a  it  c 


(242) 


which  is  equivalent  to  equation  (229)  written  as 

6  -  2(0  +8  )  +  tt  *  8  -  2(e!  +  -  20 

vava  a  aa  uu  t  tt  c 


(243) 


For  the  special  case  of  a  slow  process  in  coherent  space  and  incoherent 
time  as  described  by  equation  (238)  it  follows  that  equation  (240)  becomes 


_  <,c  -a  ?  2  -2  2 

-  7  sec  8  a  “  3  Y/30  =  c  3  4</3t 

r  vava  a 


(244) 


Combining  equations  (100)  and  (244)  gives 

-  I  a“2[(E^)2  +  (F^)2] 1/2  =  c~Z^/dtZ 


'  ¥a 


(245) 


where  and  f|£  are  given  by  equations  (98)  and  (99)  respectively.  For  one 
space  dimension  equation  (245)  becomes 


-  cV2[(eJ2  +  (f^)2]1/2  =  32y/3t2 


(246) 


Combining  equations  (238)  and  (243)  gives 


eSC  -  20  *  -  20 

vava  a  c 


(247) 


Combining  equations  (97)  through  (99)  with  equation  (247)  gives 


ta„[2<0a  -  9c)]  -  E“/F“ 


(248) 


or 


tan[2(0a  -  0c)]  -  -  (34'/30a)/O2¥/302) 


(249) 


which  is  valid  for  a  slow  process  in  coherent  space  and  incoherent  time. 
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Case  c.  Slow  Process,  Incoherent  Space  and  Coherent  Time. 

This  section  derives  a  general  form  of  the  wave  equation  which  can  be 
used  to  attain  the  limiting  case  of  a  slow  process  occurring  in  incoherent 
space  and  coherent  time  which  is  defined  by  the  following  characteristics 


0*  =  0 


811)UJ  =  0  6=0  0  =0  8  =  tt/2 

VV  a  act  tt 


(250) 


Equations  (58),  (173),  (176)  and  (227)  give 


l  £  =  -  c"2C 

X  a  st 


(251) 


or  equivalently 


I  sec  8  cos  8  9/9a(sec  81U1U  cos  8  94'/3a) 

\rn\rn  nn  v  WV  nfi  ' 


(252) 


-9  u/  _  j  w  U/  _  |  V 

“  -  c  sec  8uu  sin  t  9/96^.  (sec  8^  sir  Btt  t  3H'/99t) 
while  equations  (60),  (174)  ar  1  (177)  give 


=  6’  -  20 
£ol  £t  c 


(253) 


er  =  e_  -  26  +  it 

?a  ?t  c 


(254) 


or  equivalently  from  equations  (61),  (174)  and  (229) 

6  -  2(6  +  8  )  =  8  -  2(6^  +  b!_)  -  26  +  ir 

vctva  a  oa  uu  t  tt  c 


(255) 


For  the  limiting  case  of  a  slow  wave  propagation  process  in  incoherent 
space  and  coherent  time,  equations  (250)  and  (252)  give 


l  924'/9a2  =  -  c~2  sec  0SC  t~2  324'/9e'J2 
a  •  uu  t 


(256) 


Combining  equations  (182),  (183)  and  (256)  gives 
l  92f/3a2  -  -  c"2t_2[(E^)2  +  (F^)2]1/2 


(257) 


SC  sc 

where  E<j,t  and  F\j,t  are  given  by  equations  (181)  and  (182).  For  one  space  dimen¬ 
sion  equation  (257)  becomes 


c2t2924'/9x2  -  -  ((E^)2  +  (F®^)2]i/2 


(258) 


Combining  equations  (250)  and  (255)  gives  the  phase  angle  condition  for  this 
limiting  case  of  the  wave  equation  as 
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(259) 


0  =  &SC  -  20^  -  26 
uu  t  c 

Combining  equations  (180)  through  (182)  and  (259)  gives  the  phase  angle  condi¬ 
tion  as 

tan[2(ej  +  6c))  -  E^/F^  (260) 

or  equivalently 

tan[2(0^  +  0c)]  =  -  (a't'/3eJ)/(32'l'/30^2)  (261) 

Case  d.  Slow  Process,  Coherent  Space  and  Coherent  Time. 

A  general  form  of  the  wave  equation  is  derived  in  this  section  that  can 
be  used  to  obtain  the  limiting  case  of  a  slow  process  occurring  in  coherent 
space  and  coherent  time  which  is  described  by 

0^-0  8^-0  8aa  -  ir/2  6jt  -  tt/2  (262) 

The  combination  of  equations  (90),  (93),  (173)  and  (227)  gives 

-l  ea  -  -  c'\ 

a 

or  equivalently 

l  sec  6vava  Sln  3aa  3/30a(sec  s*n  0aa  3^30a>  (264> 

“  c  2  sec  Buu  sin  8^t  t  1  3/38^(sec  8^  sin  8^  t  1  df/dQ^) 


while  equations  (60),  (94),  (174),  (177)  and  (229)  give 

0*  -01  -  20 
£a  Ct  c 


(265) 


8  -2(0  +8  )*8  -  2(8^  +  el.)  -  20 

vava  a  aa  uu  t  tt  c 

where  +  tt  has  been  cancelled  from  both  sides  of  equation  (266) 


(266) 


For  the  special  case  of  a  slow  wave  propagation  process  in  coherent  space 
and  coherent  time,  equations  (262)  and  (264)  give 


l  sec  8SC  a*2  32<F/3e2  -  c-2  sec  0SC  t“2  32H</3eJ2 
“  vava  a  uu  t 

Combining  equations  (99),  (100),  (182),  (183)  and  (267)  gives 

l  a'2((E!^)2  +  (F^)2]172  -  c“V2[(E£)2  +  (F^)2]172 


(267) 


(268) 
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sc  sc  sc  sc 

where  E^a  ,  F^a  ,  t  and  are  given  by  equations  (98),  (99),  (181)  and  (182) 
respectively.  For  one  space  dimension,  equation  (268)  becomes 


(ct/x)2[(E^)2  +  (Fj^)2]  1/2  =  [(E^)2  +  (Fj^)2] 1/2 


(269) 


Combining  equations  (262)  and  (266)  gives 
,sc 


8""  -  26 
vava  a 


8SC  -  2dl  -  26 

uu  t  c 


(270) 


Equation  (270)  can  be  rewritten  using 

tan'^E^/F^)  -  tan'1  (e“/fJ“)  =  2(6^  -  8*  -  0c) 


(271) 


which  is  valid  for  a  slow  process  in  coherent  space  and  coherent  time. 

Case  e.  Ultrafast  Process,  Incoherent  Space  and  Incoherent  Time. 


This  section  develops  the  general  form  of  the  wave  equation  that  can  be 
utilized  to  attain  the  limiting  case  of  an  ultrafast  process  in  incoherent  space 
and  incoherent  time  which  is  represented  by 


*  tt/2 


6=0 

a 


8  =0 
aa 


0*  =  0 


8 


'V 

tt 


0 


(272) 


Combining  equations  (66),  (71),  (149),  (154)  and  (227)  gives 

-2. 


-u 


-  c 


(273) 


or  equivalently 
I  esc  6 
-2 


a 

=  c 


2 

cos  8  30.,,/3a  36  /3ot 

vava  aa  4  va 


esc  8  cos2b!\  39u,/3t  36  /3t 
uu  tt  V  u 


(274) 


where  a  factor  41  esc  B^  has  been  divided  out  of  both  sides  of  equation  (274) . 
The  phase  angle  condition  is  obtained  from  equations  (69),  (72),  (151),  (155) 
and  (229)  as 


'5a 

or  equivalently 


et  -  26 

5t  c 


8  -  2(6  +  8  ) 
vava  a  aa' 


ai  Uf 

8  -  2(6  +  8*  )  -  26 
uu  t  tt  c 


(275) 


(276) 


where  -  it  has  been  cancelled  from  both  sides  of  equation  (276)  . 


For  the  special  case  of  an  ultrafast  wave  propagation  process  in  incoher¬ 
ent  space  and  incoherent  time,  equations  (36),  (47),  (272)  and  (274)  give 


<*43 


I  CSC  0ui  (30„/3a)2  =  c~2  CSC  6Ui  (30  /3t)2 
£  vava  y  uu  y 

From  equations  (79),  (81)  (162),  (164)  and  (277)  it  follows  that 


cr/_,uiv2  ,  ,_u:uz,i/z  -zr/„ui.z  .  /T,uiNz, 

U<*fo>  +  <F^a)  1  "  c  t<Eyt)  +  <F„J  1 


-ui.2.1/2  _  -2,  /t,uis  2 


ui. 2, 1/2 
4't; 


(277) 


(278) 


where  E^a  »  Fya  »  Eyt  and  F?t  are  given  by  equations  (79),  (80),  (162)  and  (163) 
respectively.  For  one  space  dimension  equation  (278)  becomes 


c2[(E^)2  +  (F“j)2]1/2  =  {(E^)2  +  (F^)2]172 


(279) 


Combining  equations  (272)  and  (276)  gives  the  phase  angle  equation  for  the  wave 
equation  in  this  special  case  as 


gui  -  eui  _  20 

vava  uu  c 


(280) 


and  finally  using  equations  (83),  (166)  and  (280)  gives 


K,„  =  K  -  20 

Ya  Vt  c 


(281) 


where  and  Kyt  are  given  by  equations  (84)  and  (167)  respectively. 

Case  f.  Ultrafast  Process,  Coherent  Space  and  Incoherent  Time. 

Consider  now  the  development  of  a  general  form  of  the  wave  equation  that 
can  be  used  to  obtain  the  limiting  case  of  an  ultrafast  wave  propagation  pro¬ 
cess  in  coherent  space  and  incoherent  time  which  is  described  by 


=  t/2 


B  =  n/2 

act 


et  -  ? 


B 


tt 


0 


(282) 


Combining  equations  (106),  (120),  (149),  (154)  and  (227)  gives 
-2_ 


-u 


=>  -  c 


(283) 


or  equivalently 


£  esc  B  sin2B  a  2  30/30  30  /30 

"  vava  aa  y  a  va  a 

=  c  2  esc  &uu  cos2eJt  ao^/at  aeu/at 


(284) 


where  a  factor  y  esc  6414/  has  been  divided  out  of  both  sides  of  equation  (284). 
Combining  equations  (107),  (121),  (151),  (155),  (228)  and  (229)  gives 


0 


5a 


0’  -  26 
£t  c 


(285) 


or 
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(286) 


0_  +  ir  =  0_  -  20  -  ir 

?a  5t  c 


or  equivalently 


8  -  2(8  +8  )  +  ir  -  0  -  2(0^  +  B^  )  -  20  -  ir 

vava  v  a  aa'  uu  v  t  tt  c 


(287) 


For  the  special  case  of  an  ultrafast  wave  propagation  process  in  coherent 
space  and  incoherent  time,  equations  (42),  (47),  (282)  and  (284)  give 

l  CSC  Cva  °’2  9V30a(3V30a  ~  0  =  c’2  csc  C  <9V3t)2  (288) 


Combining  equations  (126),  (131),  (162),  (164)  and  (288)  gives 
l  a'2t(E^)2  +  (F^)2]172  -  c_2[(E^)2  +  (F^)V/2 


(289) 


where  ,  F^a  »  Eft  an<^  Fy*  are  given  by  equations  (126),  27),  (162)  and 

(163)  respectively.  For  one  space  dimension  equation  (289)  becomes 


:V2t<)2  +  (F^)2]1/2  =  [(eJ£)2  +  (f^)21I/2 


(290) 


From  equations  (282)  and  (287)  it  follows  that 

BUC  -  20  -  6U1  -  26  -  ir 

vava  a  uu  c 


(291) 


Combining  equations  (133) ,  (166)  and  (291)  gives 


fi¥a  “  20a  "  St  ”  20c 


(292) 


where  6vj»a  and  are  defined  by  equations  (134)  and  (167)  respectively. 

Case  g.  Ultrafast  Process,  Incoherent  Space  and  Coherent  Time. 

This  section  gives  the  general  wave  equation  for  spacetime  with  broken 
internal  symmetries  that  can  be  used  to  approach  the  limiting  case  of  an  ultra¬ 
fast  wave  propagation  process  in  incoherent  space  and  coherent  time  whose  char¬ 
acteristics  are  given  by 


8^  -  ir/2 


6-0 

a 


8  -  0 
aa 


8  -  n/2 

tt 


(293) 


From  equations  (66),  (71),  (189),  (203)  and  (227)  it  follows  that 


-  I  «.  ■  -  ‘S 


(294) 


o 

y  csc  8  cos  8  30U./3O  30  /3a 

£  vava  aa  T  va' 

-  c"2  esc  8uu  sin28^t  t"2  30^/30*  38u/30* 


(295) 
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where  a  factor  ¥  esc  has  been  divided  out  of  both  sides  of  equation  (295). 
Combining  equations  (69),  (72),  (190),  (204),  (228)  and  (229)  gives 


6 


t 


20 


c 


(296) 


or 


6_  -  it  =  0_  -  28  ,  +  ir 

5a  5t  c 


or  equivalently 
6 


-2(0  +6  )  -  n  =  3  -  2  (0y  +  b!_)  -  20  +  ir 

vava  a  aa  uu  t  tt  c 


(297) 


(298) 


For  the  special  case  of  an  ultrafast  wave  propagation  process  in  incoher¬ 
ent  space  and  coherent  time,  equations  (36),  (53),  (293)  and  (295)  give 


l  esc  3^  (30y/3a)i<:  =  c-V2  esc  3^  30y/38* (30^/30*  -  1) 
Combining  equations  (79),  (81),  (209),  (214)  and  (299)  gives 


c r  /r.ui.  2  ,  ui.2.1/2  _  -2  -2.  .  uc  2  ,  uc.2.1/2 

2,1  (E,-)  +  (F«,„)  1  =  c  t  [  (E^)  +  (F^^)  ] 

a  TU 


-ft' 


‘  + 1 " 


(299) 


(300) 


where  Ey^  ,  Fy^  ,  Ey^  and  Fy^  are  given  by  equations  (79),  (80),  (209)  and  (210) 
respectively.  For  one  space  dimension  equation  (300)  becomes 


2  2r  ,  ui.2  ,  ui.2,1/2  _  f  .  uc.2  .  uc.2,1/2 

t  f(E4<x^  +  (Ffx)  1  l(Ett)  +  (F4-t)  J 


(301) 


Equations  (293)  and  (298)  give  the  internal  phase  angle  equation  for  this  spec¬ 
ial  case  of  the  wave  equation  as 


3 


ui 

vava 


7 r 


3UC 

uu 


(302) 


Combining  equations  (83),  (216)  and  (302)  gives 


Ya 


St  -  29t 


26 

c 


(303) 


where  Kya  and  <5yc  are  defined  by  equations  (84)  and  (217)  respectively. 

Case  h.  Ultrafast  Process,  Coherent  Space  and  Coherent  Time. 

In  this  section  a  completely  general  form  of  the  wave  equation  in  broken 
symmetry  spacetime  is  developed  for  the  purpose  of  attaining  the  limiting  case 
of  an  ultrafast  wave  propagation  process  in  coherent  space  and  coherent  time 
which  is  described  by 

-  w/2  Baa  -  ir/2  “  */2  (304) 

From  equations  (106),  (120),  (189),  (203)  and  (227)  it  follows  that  the  scalar 
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component  of  the  wave  equation  is 

-  K  -  - 


(305) 


or  equivalently 


l  esc  0  sin^g  a  2  30,.,/30  30  /36 

£  vava  aa  V  a  va  a 

-  c  CSC  6uu  sin48jt  t  30^/30^  30^36* 


(306) 


where  a  factor  S'  esc  0<j has  been  divided  out  of  both  sides  of  equation  (306)  . 
Combining  equations  (107),  (121),  (190),  (204),  (228)  and  (229)  gives  the  phase 
angle  condition  for  the  wave  equation  as 


01  =  0’  -  20 

£a  5t  c 


(307) 


or 


er  +  "  =  0r  -  20  +  ir  (308) 

5t  c  ' 

or  equivalently  as 

0  -  2(0  +  0  )  -  0  -  2(0*  +  0*  )  -  20  (309) 

vava  a  aa  uu  t  tt  c 

where  +  it  has  been  cancelled  from  both  sides  of  equation  (309) . 

In  the  special  case  of  an  ultrafast  wave  propagation  process  in  coherent 
space  and  coherent  time,  equations  (42),  (53),  (304)  and  (306)  give 

l  esc  0UC  a’2  30  /30  (30  / 30  -  1)  (310) 

“  vava  V  a  '?  a 

-  c‘2  esc  t-2  30^/30^(30^/30^  -  1) 

Combining  equations  (126),  (131),  (209),  (214)  and  (310)  gives 

l  «'2t(E^)2  +  (F^)2]172  -  (ct)"2[(E^)2  +  (F^)2]172  (311) 

where  ,  Fy£  ,  E^  and  F^  are  given  by  equations  (126),  (127),  (209)  and  (210) 
respectively.  For  one  space  dimension  equation  (311)  becomes 

f  *i  /  .2.  i-UCi  2  .  y— uc .2,1/2  »  ,_uc .2  ,  .-.uc,  2,1/2  /.i,, 

(ct/x)  I(Efx)  +  (Ffx)  ]  -  [(E^)  +  (Fyt)  j  <312) 


From  equations  (304)  and  (309)  it  follows  that  the  phase  angle  relationship 
for  this  special  case  of  the  wave  equation  is  given  by 


BUC 

vava 


5uc 

uu 


20. 


(313) 
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(314) 


Combining  equations  (133),  (216)  and  (313)  gives 


6 


¥ci 


26 

c 


where  <S^a  and  are  defined  by  equations  (134)  and  (217)  respectively.  The 
results  given  above  can  also  be  obtained  from  equations  (137)  and  (220)  which 
give  the  wave  equation  for  an  ultrafast  process  in  coherent  space  and  coherent 
time  as 


-2-uc 


l  C  =  1/C? 


(315) 


or  equivalently 

l  i/52[9Va0a(3V36a  "  1}  '  (316) 

-  (Et)_2[ 36/30^(30/36*  -  1)  -  j320/36*2] 

Equation  (316)  is  equivalent  to  equations  (311)  and  (313).  The  one  dimensional 
analog  of  equation  (316)  is  given  by 

(c£/5)2[304,/36xOe4f/36x  -  1)  -  j326/362]  (317) 

-  39/36*00/36*  -  1)  -  j3Z0/36*2 

4.  WAVE  EQUATION  SOLUTIONS.  The  general  forms  of  the  wave  equations  de¬ 
rived  in  Section  3  are  too  complicated  to  allow  simple  analytical  solutions. 
This  is  true  for  equations  (233),  (240),  (252),  (264),  (274),  (284),  (295)  and 
(306)  and  their  corresponding  phase  angle  equations  (232),  (243),  (255),  (266), 
(276),  (287),  (298)  and  (309).  For  some  special  cases  of  the  wave  equation  it 
is  relatively  easy  to  find  analytical  solutions,  and  it  is  these  cases  that  are 
considered  in  this  section. 


Case  a.  Slow  Process,  Incoherent  Space  and  Incoherent  Time. 

The  standard  wave  equation  (236)  has  the  following  well  known  solution1 8-2  4 

¥  -  A  exp[i(£  k  a  ±  cot]  (318) 

a  a 

where  a  =*  x  ,  y  and  z  .  For  this  case  a  and  t  are  variables,  and  equation  (318) 
is  a  solution  to  equation  (236)  only  if  the  external  space  wave  numbers  are 
are  related  to  the  external  frequency  co  by  the  following  standard  equation 

l  k2  -  io2/c2  (319) 

a  a 

where  c  *  propagation  speed.  For  the  one  dimensional  case  the  solution  to  the 
wave  equation  is 

¥  *  A  expfik/x  ±  ct)]  (320) 
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where 


(321) 


k  «  w/c 
x 

The  t  signs  in  equations  (318)  and  (320)  correspond  to  the  two  possible  direc¬ 
tions  for  the  wave  propagation  along  each  axis. 

Case  b.  Slow  Process,  Coherent  Space  and  Incoherent  Time. 


For  this  special  case  the  wave  equation  (245)  has  the  following  simple 
solution 


S'  *  A  exp(  -  l  a  8)  exp[i(£  k  a  ±  rnt)]  (322) 
a  u  a  a 

where  0a  and  t  are  variables,  a  =  x  ,  y  and  z  »  constants,  aa  “  constants,  and 
where  as  usual  ka  and  u>  are  constants.  From  equations  (.98),  (99)  and  (322)  it 
follows  that 


a  Y 
a 


(323) 


2  2  2 
3  f/3t  -  -  to  ¥ 


(324) 


Equations  (98)  and  (323)  give  the  condition  aa  >  0  .  Combining  equations  (245)  , 
(323)  and  (324)  gives  the  condition  between  the  internal  space  parameters  aa 
and  the  external  time  angular  frequency  u>  required  for  a  solution 

l  cf2a  (a 2  +  1)  1/2  -  (ui/c)2  (325) 

a  a  a 


From  equation  (97)  it  follows  that  for  this  solution 


tan  8 


sc 

vava 


(326) 


For  the  one  dimensional  case  the  wave  function  is 

¥  «*  A  exp(  -  ax®x)  exp[ikx(x  ±  ct)]  (327) 

which  gives  the  condition  for  a  solution  as 

ax(ax  +  1)1/2  "  (328) 

which  gives  the  constant  as 

ax  -  {  -  1/2  +  l/2[  1  +  4(ojx/c)4]1/2}1/2  (329) 

where  ax  >  0  .  This  case  corresponds  to  the  wave  amplitude  ¥  propagating  eva- 
nescently  in  internal  space  8a  for  fixed  spatial  coordinate  magntiudes  a  ,  and 
oscillating  in  external  time.  For  this  special  case  the  measured  space  coor¬ 
dinates  change  in  time  according  to 
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(330) 


a  *»  a  cos  6  da  /dt  -  -  a  sin  0  d6  /dt 

m  a  m  a  a 

There  is  no  wave  propagation  in  external  space  for  this  case  because  a  -  con¬ 
stant  in  the  phase  factor  of  equation  (322) . 

Case  c.  Slow  Process,  Incoherent  Space  and  Coherent  Time. 

The  wave  equation  corresponding  to  this  special  case  is  given  by  equation 
(257)  and  has  the  solution 

4*  =  A  exp(  -  b  e'f)  exp(i(£  k  a  ±  mt)]  (331) 

t  t  a  a 

4» 

where  a  and  0t  are  variables,  t  =  constant  and  bt  =  constant.  Equations  (181), 
(182)  and  (331)  give 

Esc  =  b  t  FSC  *  b2f  (332) 

ft  t  ft  t 

d V/da  =  -  k  f 
a 

Equations  (181)  and  (332)  give  bt  >  0  .  Combining  equations  (257),  (332)  and 
(333)  gives  the  following  connection  between  the  external  space  wave  numbers 
1%  and  the  internal  time  parameter  bt 

l  k2  -  (ct)~2b  (b2  +  1)1/2  (334) 

a  u  l  l 

where  a  *  x  ,  y  ,  z  .  From  equation  (334)  it  follows  immediately  that  for  bt  >  0 

b  »  {  -  1/2  +  l/2(l  +  4(ct)4(£  k2)2]l/2}1/2  (334A) 

a  a 

For  this  solution  equation  (180)  gives  the  following  phase  angle  condition 

tan  BSC  =  b'1  (335) 

uu  t 

For  the  one  space  dimension  case  the  solution  to  the  wave  equation  is  obtained 
from  equation  (331)  as 

f  =  A  exp(  -  b  s'f)  exp[ik  (x  ±  ct)]  (336) 

C  C  X 

subject  to 

bt(b2  +  1)1/2  =  (ctkx)2  (337) 

from  which  the  constant  bt  is  determined  to  have  the  value 

b  -  {  -  1/2  +  1/2(1  +  4(ctk  )4]1/2}1/2  (338) 

C  X 

with  bt  >  0  .  The  wave  function  in  equation  (331)  corresponds  to  t  -  constant 
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so  that  the  wave  propagates  In  external  space  with  an  evanescent  wave  propa¬ 
gation  in  internal  time.  However,  the  measured  time  does  change  during  the 
motion  because 


4* 

t  =  t  cos  0 
m  t 


dt  /dx  «  -  t  sin  6^  dQ^/dx 
m  C  t 


(339) 


There  is  no  wave  propagation  in  external  time  because  t  =  constant  in  the  phase 
angle  of  equation  (331). 

Case  d.  Slow  Process,  Coherent  Space  and  Coherent  Time. 


For  the  special  case  the  wave  equation  (268)  has  the  solution 
a  “  “  “  z  a 


Y  m  A  exp(  -  l  d  8  -  d  8^)  expfi(£  k  a  ±  tot)] 

rs.  Ui  U  L  U  (i 


(340) 


,4- 


where  0a  and  0t  are  variables,  a  and  t  =  constants,  and  where  da  and  dt  »  con¬ 
stants.  Equations  (98) ,  (99),  (181),  (182),  (268)  and  (340)  give 


Fsc  =  d  ¥ 
t,'Fa  aa 


St  dt‘ 


F®C  =  d2V 
Va  a 


fSc  -  d2» 


4<t 


(341) 

(342) 


Combining  equations  (98),  (181),  (341)  and  (342)  gives  da  >  0  and  dt  >  0  .  Then 
combining  equations  (2 68),  (341)  and  (342)  gives  the  condition  between  da  and 
dt  that  is  required  for  equation  (340)  to  be  a  solution  of  the  wave  equation 
(268)  as  follows 


l  a’Zda<da  +  1>1/2  =  (ct)‘Zdt(d^  +  1) 


1/2 


(343) 


or  equivalently 

d2  -  -  1/2  +  l/2{ 1  +  4(ct)4[£  cf2d  (d2  +  i)1/2]2}1/2 
r  -  a  a 


(344) 


where  dt  >  0  .  For  this  solution  the  phase  angle  conditions  in  equations  (97) 
and  (180)  give 


Qsc  .-I 

tan  8  *  d 

vava  a 


_  „sc  ,-l 
tan  6  =  d 

uu  t 


(345) 


For  the  case  of  one  space  dimension  the  solution  to  equation  (269)  is  written  as 


*4'  =  A  exp(  -  d^©^  -  dt®t^  exPti(kxx  ±  wt)] 


(346) 


subject  to  the  following  condition  between  the  constants  and  dt 
(ct/x)2dx(d2  +  i)1/2  -  dt(d2  +  1)1/2 


(347) 


or  equivalently 


451 


(348) 


d  «  {  -  1/2  +  1/2 [  1  +  4(ct/x)4d2(d2  +  1)]1/2}1/2 

where  dx  0  and  dt  >  0  .  Because  a  =  constant  and  t  =  constant  there  is  no  wave 
propagation  in  external  space  or  time.  This  case  corresponds  to  an  evanescant 
propagation  of  the  wave  amplitude  4*  in  both  internal  space  and  internal  time. 
However,  the  measured  space  coordinates  am  and  the  measured  time  coordinate  tm 
are  changing  during  this  process  as  shown  by  equations  (329)  and  (338) 

(349) 

dt  /d0^  =  -  t  sin  dj  (350) 

m  t  t 

Therefore  this  case  should  be  physically  observable. 

Case  e.  Ultrafast  Process,  Incoherent  Space  and  Incoherent  Time. 

For  this  special  case  a  simple  solution  to  equation  (278)  is  written  for  the 
case  of  three  dimensional  cartesian  coordinates  as 


da  /d8"  = 
m  t 


-  a  sin  0  d0  /d0^ 
a  a  t 


Y  “  A  exp(j0f) 


(351) 


e  a  +  e  t 
a  t 


+  e 

o 


(352) 


where  a  and  t  are  variables  with  a  »  x  ,  y  ,  z  > and  wnere  ea  ,  et  and  e0 
stants.  In  this  case,  equations  (79),  (80),  (162)  and  (163)  give 


2 

e 

a 


=  0 


con- 


053) 


054) 


where  ea  and 


et  are  positive  or  negative  real  numbers, 


(278),  (353)  and  (354)  gives 


I 

a 


a 


-2  2 
c  e 


- ±  cd 

a 


e2)U2 

a 


Combining  equations 


(355) 


which  gives  the  relation  between  the  constants  ea  and  et  that  is  required  in 
order  for  equations  (351)  and  (352)  to  be  a  solution  to  the  wave  equation  (278). 
For  this  solution  equations  (78)  and  (161)  give  the  phase  angle  conditions  as 


6 


ui 

vava 


tt/2 


3Ui 

uu 


IT/ 2 


(356) 


For  the  case  of  one  space  dimension  the  wave  function  is 

V  m  A  exp(j6 (357) 

9^  *  exx  +  ett  +  eQ  (358) 
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subject  to  the  condition 

2  2  2 

et  =  c  ex  et  =  ±  cex  (359) 

so  that 

9^  =  ex(x  ±  ct)  (360) 

where  the  constant  eQ  can  always  be  absorbed  into  the  multiplicative  constant, 
so  that  finally 

?  *  A  exp[jex(x  ±  ct)]  (361) 


where  x  and  t  are  variables.  This  special  case  corresponds  to  the  propagation 
of  the  internal  phase  angle  0^  in  external  space  and  time. 

Case  f.  Ultrafast  Process,  Coherent  Space  and  Incoherent  Time. 

In  this  special  case  a  simple  solution  to  equation  (289)  is 


y  =  A  exp (j 6^) 


(362) 


0 


V 


=  l 


f  0 
a  a 


+  f  t  +  f 
t  o 


(363) 


where  0a  and  t  are  variables,  a  =  x  ,y,  z  *  constants,  and  fa  ,  ft  and  fG  =  con¬ 
stants.  For  this  case  equations  (126),  (127),  (162)  and  (163)  give 


=  f  (f 

a  a 


1) 


=  f 


2 

t 


(364) 

(365) 


From  equations  (126)  and  (364)  it  follows  that  Evpa  <  0  and  0  <  fa  ^  1  .  Combin¬ 
ing  equations  (289) ,  (364)  and  (365)  gives 

I  a"2|fa(fa  "  D|  -  c"V  (366) 

which  can  be  written  equivalently  as 

l  a~2f  (1  -  f  )  *  c_2f2  (367) 

"  av  a'  t 

or  as 

f  -  ±  c [l  a“2f  (1  -  f  )]1/2  (368) 

t  g  a  a 

and  therefore  ffc  <  0  but  always  0  <  fa  <  1  .  The  condition  in  equations  (366) 
or  (367)  relates  the  constants  fa  and  ft  and  is  required  in  order  to  insure 
that  equations  (362)  and  (363)  are  a  solution  to  the  wave  equation  (289).  For 
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this  special  case  the  phase  angle  equations  (130)  and  (161)  give 

0UC  =  -  tt/2  8U1  =  rr/2 

vava  uu 

For  a  one  dimensional  space  the  solution  is 

f  =  A  exp (j  6y) 

e  =  f  e  +  f„t  +  f 

f  xx  t  o 
subject  to  the  condition 

(c/*)2fxu  -  y  - 

or  equivalently  as 


f  =  ±  c/x  ff  (1  -  f  )] 
t  x  x/J 


1/2 


(369) 

(370) 

(371) 

(372) 

(373) 


where  fx  is  a  positive  number  satisfying  0  <  fx  <  1  while  ft  can  be  positive  or 
negative.  This  case  corresponds  to  the  propagation  of  the  internal  phase  angle 
0y  of  the  wave  in  external  time  and  internal  space. 

Case  g.  Ultrafast  Process,  Incoherent  Space  and  Coherent  Time. 

For  this  case  the  wave  equation  (300)  is  valid  and  has  a  solution  of  the  form 


?  =  A  exp(j0,jl) 

%  =  l  8aa  +  gt9t  +  go 


(374) 

(375) 


where  a  and  0^  are  variables  and  t  =  constant,  and  where  ga  ,  gt  and  gc  *  con¬ 
stants.  For  this  special  case  equations  (79),  (80),  (209)  and  (210)  give 


ui 

fa 

uc 

ft 


2 


8t(gt  *  1) 


ui 

fa 

,uc 

ft 


=  0 
=  0 


(376) 

(377) 


Equations  (209)  and  (377)  give  0  <  gt  ^  1  .  Combining  equation  (300)  with  equa¬ 
tions  (376)  and  (377)  gives  the  condition  required  for  equations  (374)  and  (375) 
to  be  a  solution  of  the  wave  equation  (.300)  as  follows 


l  *  (ct)_2|g  (g  -  1) 
a  u 

=  (ct)"2gt(l  -  gt) 
Equation  (378)  then  gives 


(378) 
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(379) 


g  =  i/2  ±  1/2(1  -  (2ct )2l  g2]l/2 

L  a  a 


subject  to 
2 


I  <  (2ct) 


-2 


(380) 


The  constants  ga  can  be  positive  or  negative  real  numbers,  while  gt  is  always  a 
positive  real  number.  For  this  special  case  the  phase  angle  equations  (78)  and 
(213)  give 


6Ui  =  tt/2 
vava 


BUC  =  -  tt/2 
uu 


(381) 


For  the  case  of  one  spatial  dimension  the  solution  to  equation  (301)  is  written  as 


¥  “  A  exp(j6y) 


6f  =  V  +  gt9t  +  go 


(382) 

(383) 


subject  to  the  following  equation  that  connects  gx  and  gt 
g2  =  (ct)~2|gt(gt  -  1) | 

=  (ct)“2gt(i  -  gt) 


(384) 


with  0  gt  <  1  .  Equation  (384)  gives 
gt  =  1/2  ±  1/2(1  -  (2ct)2g2] 1/2 


with 


and 


8x  ^  (2ct) 


-1 


gx  =  ±  (ct)  1 [gfc (1  -  gt)]1/2 


(385) 


(386) 


(387) 


so  that  gx  %  0  » 

Case  h.  Ultrafast  Process,  Coherent  Space  and  Coherent  Time. 
A  solution  to  equation  (311)  can  be  written  as 
y  »  A  exp(j6y) 


e  ,■  F  h  e  +  h  ey  +  h 

y  k  a  a  t  t  c 


(388) 

(389) 


where  9a  and  et  are  variables,  a  and  t  are  constants,  and  ha  ,  ht  and  h0  are 


455 


constants.  For  this  special  case  equations  (126),  (127),  (209)  and  (210)  give 


E“C  -  h  (h  -  1)  F“C  «  0  (390) 

4*0  av  a  4'a 

FJt  =  ht(ht  '  U  =  °  (391) 

Equations  (126),  (209),  (390)  and  (391)  give  0  <  ha  ^  1  and  0  <  ht  <  1  so  that 
and  ht  are  positive  real  numbers.  Combining  equation  (311)  with  equations 
(390)  and  (391)  gives 

l  a'2lha(ha  -  Dl  =  (ct)"2|ht(ht  -  1)|  (392) 

or  since  0  <  ha  <  1  and  0  <  ht  <  1  equation  (392)  can  be  rewritten  as 

l  a_2ha(l  -  ha)  =  (ct)'2ht(l  -  ht)  (393) 

as  the  condition  relating  the  internal  space  parameters  ha  and  the  internal  time 
parameter  ht  in  order  for  equations  (388)  and  (389)  to  be  a  solution  to  the  wave 
equation  (311).  From  equation  (393)  it  follows  that 

h  =  1/2  ±  1  / 2 [ 1  -  (2ct)2£  a_2h  (1  -  h  )]1/2  (394) 

l  a  a  a 

with 


y  a"2h  (1  -  h  )  <  (2ct)“2  (395) 

"  a  a 


For  this  special  case  the  phase  angle  equations  (130)  and  (213)  become 


6 


uc 

vava 


=  -  TT  /  2 


8UC 


uu 


=  -  ir/2 


(396) 


For  a  one  dimensional  space  the  solution  to  equation  (312)  is 

5  -  A  exp(j6y)  (397) 

6,  =  h  0  +  he*  +  h  (398) 

4*  xx  t  t  o 

subject  to  0  <  hx  <  1  and  0  <  ht  <  1  ,  and  equation  (392)  becomes 

|hx(hx  -  1)|  -  [x/(ct)]2iht(ht  -  1)|  (399) 

Because  0  ^  hx  <  l  and  0  <  ht  <  1  equation  (399)  can  be  written  as 

hxU  -  hx)  -  [x/(ct)]2ht(l  -  ht)  (400) 


which  relates  hjj  and  ht  .  From  equation  (400)  it  follows  that 
tv  =»  1/2  ±  1 / 2 [ 1  -  (2ct/x)2h  (1  -  h  )]1/2  . 

t  XX 


(401) 
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with 


hx(1  -  hx)  <  (x/(2ct)]2  (402) 

This  case  represents  wave  propagation  of  the  internal  phase  64)  in  internal  space 
and  internal  time. 

Consider  now  an  alternative  analysis  of  the  special  case  of  an  ultrafast 
wave  propagation  process  in  coherent  space  and  coherent  time.  The  solution  to 
the  wave  equation  (16)  is  written  as 

5  -  *  exp(j0^)  =  XxXyXzT  (403) 

Then  it  follows  that 


f  =  X  X  X  T 
x  y  z 


0'f  ~  l  6Xa  + 


(404) 

(405) 


=  9Xx  +  9Xy  +  6Xz  +  9T 

Substituting  equation  (403)  into  equation  (16)  yields  the  following  complex 
number  generalization  of  the  standard  scalar  time  equation  and  the  standard  one 
dimensional  scalar  homogeneous  Helmholtz  equations 

d2T/dt2  +  <32T  -  0 


18-24 


d2x  /da2  +  k2x  =0 

a  a  a 


(406) 

(407) 


where  a  *  x  ,  y  ,  z  and 
T  -  T  exp(j6T) 


X  -  X  exp(j0  ) 
a  a  r  Xa 


(408) 

(409) 


and  where  the  complex  number  angular  frequency  and  wave  number  are  written  as 


u5  =  a)  exp(jO^) 


k  ■  k  exp(i8,  ) 
a  a  y  J  ka 


(410) 

(411) 


It  is  required  to  solve  equations  (406)  and  (407) . 

The  forms  of  the  complex  number  second  derivatives  that  appear  in  equa¬ 
tions  (406)  and  (407)  have  already  appeared  in  equations  (137)  and  (220).  Ap¬ 
plying  equations  (137)  and  (220)  to  equations  (406)  and  (407)  and  dividing 
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through  by  T  and  Xq  respectively  gives 

deT/det(i  -  deT/de^)  +  jdzeT/deJ;z  =  tz5z 
deY  /de  (i  -  dev  /de  )  +  jd2ev  /de2  -  s2k2 

Xa  a  Xa  a  J  Xa  a  a 


(412) 

(413) 


where  t  and  a  =  constants.  But  for  wave  propagation  to  be  possible  it  has  been 
shown  that  o>t  and  kact  must  be  real  numbers.1  Therefore 

-  6t  (414) 

-  9  (415) 


-2-2 

2  2 

e 

03 

t  u 

_2r2 

=  t  03 

2,  2 

a  k 

=  a  k 

9, 

a 

a 

ka 

Combining  equations  (412)  through  (415)  gives 

deT/deJ(i  -  deT/de^)  *  t2o>2 

2  V? 

dZ0T/d0^  =  0 


(416) 

(417) 


dev  /de  (i  -  dev  /de  )  =  a2k2 

Xa  a'  Xa  a  a 

d2ev  /de2  *  o 

Xa  a 


(418) 

(419) 


The  solutions  to  equations  (416)  through  (419)  are  given  by 


6T  '  Vt  +  C1 


ev  =  h  e  +  c_ 

Xa  a  a  2 


(420) 

(421) 


provided  that 


h t ( 1  -  ht)  =  tV 

h  (1  -  h  )  «  a2k2 
a  a  a 


(422) 

(423) 


so  that  0  <  ht  ^  1  and  0  <  ha  <  1  .  Equivalently,  equations  (422)  and  (423)  can 
be  written  as 


ht  *  1/2  t  1/2(1  -  4t2u)2)i/2 

h  -  1/2  ±  1/2(1  -  4a2k2)l/2 
a  a 


(424) 

(425) 


subject  to 
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<0  <  (2t) 


k  <  (2a) 
a 


(426) 


Equations  (424)  and  (425)  are  equivalent  to  equation  (394),  and  equation  (426) 
is  equivalent  to  equation  (395),  as  can  be  seen  by  noting  that  equations  (422) 
through  (425)  are  equivalent  to  equation  (394)  because  it  follows  from  equation 
(423) 

1  a~\d  ~  h  )  =  l  k2  =  k2  =  m2/c2  (427) 

a  a  a  a  a 


Combining  equations  (405),  (420)  and  (421)  gives 


which  is  just  equation  (389).  Equations  (412)  and  (413)  or  equivalently  equa¬ 
tions  (416)  through  (419)  describe  internal  phase  wave  propagation  that  occurs 
at  a  fixed  point  (a,t)  in  spacetime. 


5.  CONCLUSION.  The  wave  equation  is  somewhat  more  complex  than  it  is 
generally  thought  to  be.  In  the  presence  of  an  external  field  such  as  gravi¬ 
tation  or  electromagnetism,  the  wave  amplitude  and  the  space  and  time  coordi¬ 
nates  can  be  taken  to  be  complex  numbers  in  an  internal  space.  For  this  case 
the  wave  equation  has  eight  possible  limiting  forms  according  to  whether  the 
wave  amplitude  changes  slowly  or  in  an  ultrafast  manner  and  according  to  wheth¬ 
er  the  change  in  the  space  coordinates  is  coherent  or  incoherent  and  the  change 
in  the  time  coordinates  is  coherent  or  incoherent.  For  a  slow  wave  propagation 
process  the  wave  amplitude  changes  in  magnitude  whereas  for  the  case  of  an 
ultrafast  wave  propagation  process  the  wave  function  changes  by  <*  rotation  in 
internal  space.  Similarly,  incoherent  space  and  time  changes  occur  through  a 
variation  of  the  magnitudes  of  the  space  and  time  coordinates,  whereas  coher¬ 
ent  space  and  time  changes  occur  as  rotations  in  an  internal  space.  The  eight 
possible  forms  of  the  wave  equation  have  unique  solutions  with  their  own  spec¬ 
ial  characteristics  and  suggest  that  new  and  as  yet  unobserved  phenomena  can 
occur  for  wave  propagation  in  matter  in  the  presence  of  strong  external  fields 
such  as  may  be  realized  in  stellar  compact  objects. 
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CLEAN  FISSION  NUCLEAR  REACTORS 


Richard  A.  Weiss 

U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT .  This  paper  develops  the  basic  physics  concepts  necessary  for  the 
design  of  clean  fission  nuclear  reactors.  The  concepts  are  based  on  the  creation 
of  a  broken  symmetry  state  of  the  atomic  number  and  the  atomic  mass  number  when 
nuclei  are  placed  in  an  electromagnetic  field.  In  an  electromagnetic  field  the 
atomic  number  and  atomic  mass  number  of  a  nucleus  must  be  represented  by  special 
types  of  complex  numbers  in  an  internal  space.  The  nuclear  radius,  binding 
energy  and  mass  must  also  be  represented  by  complex  numbers ,  and  therefore  a 
liquid  drop  model  type  of  nuclear  mass  formula  is  developed  that  incorporates 
complex  number  values  of  the  atomic  number  and  atomic  mass  number .  The  Bohr* 
Wheeler  condition  for  spontaneous  and  thermal  neutron  induced  nuclear  fission  is 
evaluated  for  nuclei  in  the  presence  of  an  electromagnetic  field.  The 
electromagnetic  field  introduces  the  internal  phase  angles  of  the  atomic  number 
and  atomic  mass  number  into  the  fission  condition  in  such  a  way  that  nuclear 
fission  with  thermal  neutrons  can  occur  in  nuclei  lighter  than  the  actinides . 
It  is  found  that  the  wavelength  of  the  electromagnetic  waves  required  for  the 
catalysis  of  clean  fission  nuclear  reactions  with  thermal  neutrons  is  in  the  7 
ray  region,  and  in  particular  that  the  energy  of  the  7  rays  corresponds  to  the 
giant  dipole  resonance  frequency  of  the  subactinide  element  that  is  chosen  as 
nuclear  fuel.  This  suggests  that  nuclei  in  the  presence  of  7  rays  can  be 
fissioned  by  thermal  neutrons  even  for  values  of  Z  and  A  sufficiently  small  as 
to  prohibit  fission  under  zero  field  conditions.  A  formula  is  presented  that 
allows  the  calculation  of  the  energy  and  power  density  of  the  7  rays  required  to 
catalyze  the  sustained  nuclear  fission  of  elements  lighter  than  the  actinides  by 
thermal  neutrons.  The  7  ray  catalyzed  clean  fission  nuclear  reactors  will 
produce  only  low  level  nuclear  waste  products  or  possibly  no  radioactive  wastes 
at  all.  Design  concepts  for  a  7  ray  catalyzed  thermal  neutron  induced  clean 
fission  nuclear  reactor  are  presented  which  will  deliver  clean  power  with  little 
or  no  radioactive  waste  products.  In  fact,  the  nuclear  wastes  of  present  day 
uranium  and  plutonium  reactors  can  be  used  as  fuel  elements  in  clean  fission 
nuclear  reactors,  and  in  this  way  stored  waste  radionuclides  can  be  eliminated 
in  a  useful  way  to  generate  power. 

1 .  INTRODUCTION .  The  generation  of  power  has  always  been  at  high  cost  to 
living  organisms.  For  millennia  and  still  today  beasts  of  burden  were  used  to 
generate  power  for  agricultural  purposes,  transportation  and  mining.1  With  the 
advent  of  the  industrial  revolution  came  the  engines  and  machines  which  required 
wood,  coal  and  petroleum  products  as  fuel.  The  combustion  of  millions  of  tons 
of  coal  and  gallons  of  gasoline  pollutes  the  atmosphere  with  carbon  dioxide  and 
sulfur  dioxide  among  many  other  chemical  compounds  some  of  which  are 
carcinogens ,2"8  These  pollutants  enter  the  atmosphere  and  cause  damage  to 
fore  ts  by  producing  acid  rain,  and  damage  to  people  by  an  increased  incidence 
of  lung  cancer,  emphysema  and  other  diseases.2"8  Nuclear  fission  reactors,  once 
thought  to  be  the  power  source  of  the  future,  are  now  known  to  be  a  potentially 
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dangerous  way  of  generating  power  because  they  are  susceptible  to  accidents  and 
because  they  produce  radioactive  waste  products  whose  safe  disposal  still  chal¬ 
lenges  engineers  and  scientists  a  half  century  after  the  first  nuclear  reactor 
was  developed.9-15  After  the  Three  Mile  Island  and  Chernobyl  nuclear  accidents 
the  lack  of  public  confidence  in  fission  reactor  power  plants  has  brought  the 
construction  of  new  power  plants  to  a  standstill. 

Alternative  energy  sources  are  now  one  of  the  major  research  endeavors  of 
science  and  engineering. 16-22  Wind  power  has  been  utilized  on  a  small  scale  in 
a  few  areas  but  it  cannot  power  a  city  like  Chicago.23-25  Solar  power  can  be 
used  effectively  to  heat  single  family  dwellings  in  geographic  areas  which  have 
generally  clear  skies,  but  it  is  of  limited  use  on  a  large  scale  and  cannot  be 
used  to  power  New  York  City.26-29  Geothermal  energy  is  clearly  of  use  only  in 
limited  geographic  regions  and  has  produced  no  significant  contribution  to  the 
world's  power  generation  capacity  and  it  cannot  be  used  to  power  a  city  like 
Tokyo.30-32  Nuclear  fusion  power  still  remains  a  dream  after  nearly  a  half 
century  of  research  and  billions  of  dollars  of  investment . 3 3-4 5  All  of  these 
energy  sources  are  impractical  alternatives  to  nuclear  fission  power.  If  the 
safety  and  nuclear  waste  problems  can  be  solved  it  appears  that  nuclear  fission 
reactors  still  represent  the  best  hope  for  future  power  development.46*55  But 
new  concepts  and  designs  for  clean  and  environmentally  safe  nuclear  reactors 
are  required.  This  paper  presents  the  mathematical  and  physical  theory  for  the 
development  of  y  ray  catalyzed  clean  fission  nuclear  reactors  which  have  little 
or  no  dangerous  radionuclide  waste  products,  and  which  are  operationally  safe. 
Design  concepts  for  clean  fission  nuclear  reactor  cores  are  presented. 

The  clean  fission  nuclear  processes  that  are  described  in  this  paper  are 
based  on  the  concept  of  the  broken  symmetry  of  quantum  numbers  and  space  and 
time  coordinates.56  It  has  been  suggested  that  space  and  time  coordinates  have 
internal  phase  angles  and  are  written  as  complex  numbers  as  follows56 

x  =  x  exp (j 6x)  y  =  y  exp(jQy)  z  =*  z  expCjQ^  (1) 

t  =  t  exp(j0t)  (2) 

where  the  phase  angles  describe  an  internal  space.  For  spherical  polar  coor¬ 
dinates  the  complex  number  space  coordinates  are  written  as56 

r  -  r  exp(j0r)  5  *  ♦  exp(j0^)  \p  =  9  exp(je^)  (3) 

Corresponding  to  the  complex  number  azimuthal  angle  is  the  following  complex 
magnetic  quantum  number56 

M  =  H  exp(j0M)  =  m  cos  9^  exp(-j8^)  (4) 

where  m  =  0  ,  ±1  ,  ±2  ,  ±3  ,  •  •  •  is  the  standard  magnetic  quantum  number.  Therefore56 

M  =  m  cos  8,  9m  =  -  0.  (5) 

9  M  9 

It  is  often  convenient  to  define  the  following  positive  magnetic  quantum  number56 
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M'  *  M'  exp(jQ^)  =  (m|  cos  0^  exp(-jO^)  (6) 

M*  =  Imj  cos  0,  6'  =  -  9,  (7) 

11  9  M  9 

For  0^  -  0  these  quantities  reduce  to  the  standard  concepts. 

Under  the  action  of  a  time  varying  external  field,  such  as  an  electromag¬ 
netic  wave,  or  simply  a  spontaneous  decay,  the  magnetic  quantum  number  has  the 
following  time  variation 

dM/dt  =  Fm  cos  ett  exp[j(0M  -  et  -  Btt)]  (8) 

where 

F^  »  dm/dt  cos  0^  -  m  sin  8^  d0^/dt  +  jm  cos  0M  dB^/dt  (9) 

=  dm/dt  cos  0^  +  md0^/dt  exp[j(if/2  +  0^)  ] 

*  dm/dt  cos  0^  -  m  sin  0^  d0^/dt  -  jm  cos  0^  d6^/dt 

■  dm/dt  cos  0,  -  md0,/dt  exp[j(Tf/2  -  8  )] 

9  9  9 

and  where 


tan  Btt  »  t30t/3t  (10) 

It  is  assumed  that  0M  -  0M(m,H)  and  8a  “  0^(m,H)  where  H  =  applied  magnetic  field 
strength.  The  application  of  the  chain  rule  gives  the  following  derivatives 

dOw/dt  =  30M/3m  dm/dt  +  30../3H  dH/dt  (11) 

d0A/dt  *  30 , /3m  dm/dt  +  38. /3H  dH/dt  (12) 

9  9  9 

Then  equation  (8)  can  be  written  as 

dM/dt  *  (§M  dm/dt  +  dH/dt)cos  8tt  exp[j(0M  “  9t  "  (13) 

where 

«  cos  0^  +  m30^/3m  exp[j(ir/2  +  0^)]  (14) 

»  cos  0^  -  m36^/3m  exp[j(ir/2  -  0^)] 

CM  -  m36M/3H  exp[j(it/2  +  0M)]  <1S> 

-  -  m30^/3H  exp[  j  (it/ 2  -  0^)] 

Two  special  cases  are  of  interest. 
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Case  a.  Constant  Magnetic  Field. 

For  this  case  equation  (13)  becomes 

(dM/dt)H  =  §M  cos  8tt  dm/dt  exp[j(6M  -  0t  -  Btt)]  (16) 

which  corresponds  to  spontaneous  decay  for  a  fixed  magnetic  field. 

Case  b.  Constant  m. 

From  equation  (13)  and  (15)  it  follows  that  for  this  case 

(dM/dt)m  -  CM  dH/dt  cos  8tt  exp[j(0M  -  e£  -  Btt)]  (17) 

-  m39M/3H  dH/dt  cos  Btt  exp[j(ir/2  +  26M  -  0fc  -  Btt)] 

-  -  mae^/aH  dH/dt  cos  Btt  exp[j  (it/2  -  29^  -  0fc  -  Btt>] 

-  M  sec  6„  36m/3H  dH/dt  cos  B  exp[j(n/2  +  0„  -  0_  -  8k„)] 

M  M  tt  M  t  tt 

which  corresponds  to  a  decay  associated  with  a  change  in  the  internal  phase  angle 
of  the  azimuthal  angle  that  is  induced  by  a  time  varying  magnetic  field. 

A  more  general  expression  for  the  time  derivative  of  the  complex  magnetic 
quantum  number  can  be  written  as 


dM/dt 

» 

sec 

bmm 

cos 

Btt 

dM/dt  exp(j$Mt) 

(18) 

- 

CSC 

bmm 

cos 

Btt 

M  d9M/dt  exp(j4>M(;) 

(19) 

a 

sec 

6mm 

sin 

Btt 

t~l  dM/d0t  exp(j1>Mt) 

(20) 

■ 

CSC 

BMM 

sin 

Btt 

M/t  d@M/d0t  exp(j»Mt) 

(21) 

tan  BMM 

-  M36../3M  - 
M. 

-  M38./3M 
$ 

(22) 

$  m 

Mt 

6M  + 

BMM  "  et 

-  B, 

tt 

(23) 

From  equation  (5)  it  follows  that  the  time  rate  of  change  of  the  magnitude  of  the 
magnetic  quantum  number  is  given  by 

dM/dt  *  dm/dt  cos  0  -  m  sin  0,  d9 ,/dt  (24) 

<P  <p  9 

Combining  equations  (12)  and  (24)  gives 

dM/dt  -  dm/dt[cos  0^  -  m  sin  0^  30^/3m]  -  m  sin  0^  30^/3H  dH/dt  (25) 

■  dm/dt[cos  0^  -  m  sin  9^  30M/3m]  -  m  sin  0^  39M/3H  dH/dt 
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(38) 


cos2eM  »  l/2[ 1  +  (1  -  4f2)1/2] 
f  =  m  1  sin  0M1  cos  0M1  +  m2  sin  0M2  cos  0M2)  (39) 


where  m  is  given  by  equation  (34) . 

This  brief  summary  of  the  theory  of  broken  symmetry  space  and  time  coor¬ 
dinates  and  broken  symmetry  magnetic  quantum  numbers  is  sufficient  to  begin  a 
study  of  the  effects  of  broken  symmetry  on  the  quantum  numbers  of  atomic  nuclei. 
It  is  suggested  in  this  paper  that  in  the  presence  of  a  y  ray  electromagnetic 
field  the  atomic  number  Z  and  the  atomic  mass  number  A  are  integer  quantum  num¬ 
bers,  analogous  to  |m|  of  equation  (7),  which  have  complex  number  generalizations 
z  and  a  respectively,  analogous  to  fi*  of  equation  (7),  which  determine  the  bind¬ 
ing  energy,  symmetry  energy.  Coulomb  energy,  nuclear  surface  energy,  giant  reso¬ 
nance  frequency,  incompressibility,  fissility  and  many  other  nuclear  properties. 
It  is  also  suggested  that  the  number  of  nuclei  located  in  an  electromagnetic 
field  or  gravitational  field  has  an  associated  internal  phase  angle,  which  al¬ 
lows  the  possibility  of  coherent  radioactive  decays  in  which  the  integer  number 
of  nuclei  is  fixed  but  the  internal  phase  angle  of  the  number  of  nuclei  is  changing. 
The  theory  of  nuclear  fission  will  be  affected  by  the  concept  that  the  atomic 
number  and  atomic  mass  number  must  be  taken  to  be  complex  numbers  in  an  internal 
space  when  nuclei  are  located  in  the  presence  of  an  electromagnetic  field.  It  is 
well  known  that  some  of  the  actinide  nuclei  elements  either  fission  spontaneous¬ 
ly  or  undergo  induced  fission  by  thermal  neutrons,  but  present  day  theory  suggests 
that  spontaneous  or  thermal  neutron  induced  fission  does  not  occur  for  nuclei 
lighter  than  the  actinides.  The  main  thrust  of  this  paper  is  the  development  of 
a  theory  of  y  ray  catalyzed  spontaneous  and  thermal  neutron  induced  fission  re¬ 
actions  of  nuclei  lighter  than  the  actinides  but  heavier  than  S6Fe.  The  theory 
suggests  that  thermal  neutron  induced  clean  fission  of  subactinide  elements  can 
be  accomplished  by  placing  the  potential  nuclear  fuel  in  a  y  ray  field  whose 
energy  (frequency)  is  determined  to  correspond  to  the  giant  dipole  resonance 
frequency  of  the  subactinide  fuel  element,  and  whose  power  density  is  adjusted 
until  the  internal  phase  angle  of  the  atomic  number  attains  a  critical  value  at 
which  point  the  fuel  nuclei  are  in  a  state  for  which  clean  nuclear  fission  is 
possible.  Because  the  fuel  element  to  be  fissioned  has  a  relatively  low  atomic 
mass  number,  such  as  63Cu  for  example,  the  fission  products  will  also  be  rela¬ 
tively  light  elements  which  are  in  or  not  far  removed  from  the  valley  of  beta 
stability  and  therefore  are  either  not  radioactive  or  are  low  level  beta,  alpha, 
and  neutron  emitters.  These  lighter  than  actinide  nuclei  cannot  be  used  as  f  *1 
in  conventional  reactors. 


An  outline  of  this  paper  is  as  follows:  Section  2  introduces  the  concept 
of  complex  atomic  numbers  and  complex  atomic  mass  numbers  and  investigates  their 
addition  laws  and  time  dependence,  Section  3  considers  the  radioactive  decay  of 
nuclei  located  in  an  electromagnetic  field.  Section  4  develops  a  liquid  drop 
model  type  of  nuclear  mass  formula  for  nuclei  in  an  electromagnetic  field.  Sec¬ 
tion  5  investigates  the  effect  of  an  electromagnetic  field  on  the  spontaneous 
and  thermal  neutron  induced  fission  condition  for  nuclei  and  demonstrates  the 
possibility  of  clean  fission  in  lighter  than  actinide  nuclei.  Section  6  studies 
the  final  state  energy  cc  .ditions  for  the  clean  fission  of  nuclei  lighter  than 
the  actinides  but  heavier  than  ssFe,  Section  7  presents  numerical  calculations 
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Combining  equations  (11),  (18),  (19)  and  (25)  gives 

dfi/dt  =  JM  cos  0tt  sec  cos  0M  exp(j*Mt)  (26) 

=  In  cos  Btt  CSC  cos  6^  exp(j$M(;) 

2  2  1/2 

=  (IM  +  V  C0S  gtt  C°S  SM  eXp(J$Mt) 

where  now 

tan  ^MM  = 

with 

I*.  =  tan  dm/dt  +  m/H  tan  a™,  dH/dt  (28) 

M  mm  ntl 

=  (1  -  tan  6M  tan  a1*  )dm/dt  -  m/H  tan  6 M  tan  a™  dH/dt  (29) 

M  M  mm  M  HH 

where 

H 

tan  o  =  m30../3m  (30) 

mm  M 

tan  oj^  =  H30M/3H  (31) 

Equation  (26)  is  equivalent  to  equation  (13). 

Consider  two  complex  magnetic  quantum  numbers  which  can  be  written  as 


exp(j0M1)  =  mx  cos  0M1  exp(j0M1)  (32) 

M2  =  M2  exp(j0M2)  =  m2  cos  6^  exp(j0M2)  (33) 

where  the  following  integer  addition  law  is  always  true 

m  =  +  m2  (34) 

where  m  ,  m^  and  m2  =  0  ,  ±1  ,  ±2  ,  ±3  ,  • * •  .  Then  the  addition  law  for  the  complex 
magnetic  quantum  numbers  is  written  as 

M  -r  W  =  Hl  +  M2  (35) 

where 

M  -  M  exp(j0M)  =  m  cos  0M  exp(j©M)  (36) 

where  the  unknown  quantities  W  and  0j{  are  given  by56 

21/2  2  2 

W  -  -  m/2{  1  +  (1  -  4f  )  '  ]  +  cos*0M1  +  m2  cos  9^  (37) 
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of  the  Y  ray  photon  energy  and  flux  density  that  are  required  to  catalyze  the 
clean  fission  of  subactinide  nuclei  and  gives  examples  of  clean  fission  nuclear 
reactions,  finally  Section  8  presents  design  concepts  for  a  y  ray  catalyzed 
thermal  neutron  induced  clean  fission  nuclear  reactor. 

2.  ATOMIC  NUCLEI  WITH  BROKEN  INTERNAL  SYMMETRIES.  This  section  considers 
the  broken  internal  symmetries  associated  with  the  atomic  number,  neutron  number 
and  atomic  mass  number  of  nuclei  that  are  located  in  an  electromagnetic  or  grav¬ 
itational  field. 

A.  Complex  Atomic  Number,  Neutron  Number  and  Atomic  Mass  Number. 

It  is  assumed  that  the  integer  quantum  numbers  Z,  N  and  A  are  analogous  to 
the  magnetic  quantum  number  jmj  and  occur  in  a  solution  to  an  azimuthal  portion 
of  a  wave  equation  in  internal  space.56  Then  by  an  argument  similar  to  that  for 
the  complex  number  magnetic  quantum  number  it  follows  that  the  atomic  number, 
neutron  number  and  atomic  mass  number  are  complex  numbers  in  an  internal  space 
and  canbe  written  in  a  form  similar  to  that  of  the  complex  magnetic  quantum 
number  M'  in  equation  (6)  as  follows56 

z  =  z  exp(j0  )  =  Z  cos  6  exp(j8  )  (AO) 

Z  Z  2 

n  =  n  expCjQ^)  =  N  cos  0^  expCjO^)  (AI) 

a  =  a  exp(j0a)  =  A  cos  Q&  exp(j8a)  (A2) 


where  the  complex  number  azimuthal  angles  corresponding  to  the  quantum  numbers 
5  ,  n  and  a  are  written,  in  analogy  to  the  azimuthal  angle  of  real  space  given  in 
equation  (3) ,  as  follows 


i>  =  <$>  exp  (j  0  ,  ) 
z  z  <j>z 


=  exP(j%n}  -*a  =  *a  exP(^a)  (43) 


where  6 q>2  ,  0^  and  are  the  internal  phase  angles  of  the  complex  number  azi¬ 

muthal  angles  in  the  internal  space  of  nucleons.56  The  wave  equations  in  inter¬ 
nal  space  are  written,  in  analogy  to  the  complex  number  azimuthal  equation  of 
Reference  56,  as 


2-  -2  2- 
d  /d<b  +  z  $  =0 

(4A) 

z  z  z 

d2$  /dd2  +  n25  =  0 

(45) 

n  n  n 

d2$  /d<F2  +  a2?  “0 

a  a  a 

(46) 

The  nuclear  wave  function  has  a  form  which  is  analogous  to  the  azimuthal 
wave  function  of  the  central  field  problem  with  a  broken  internal  symmetry  of 
the  azimuthal  angle  and  is  written  as 

£  ®  $  $  $  *»  C  exp[i(z<£  +  ni  +  a$  )]  (A7A) 

z  n  a  z  n  a 
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whereas  the  azimuthal  wave  function  of  the  central  field  problem  with  broken 
internal  symmetries  is  written  as56 

4>  *  C  exp(iRi)  (47B) 


Strictly  speaking  z  ,  n  and  a  correspond  to  M*  of  equation  (6).  The  measured 
values  of  the  angles  that  appear  in  equation  (47A)  are  given  by 


<j>  =  $  cos  0, 

zm  z  4>z 


nm 


cos  6^ 
n  $n 


am 


<P  COS  0  . 

a  <pa 


(48) 


If  the  wave  function  is  to  remain  unchanged  under  the  tranf ormations 


<j>  -*•  <J>  +  2it 

zm  zm 


4> 


nm 


+  2it 
nm 


am 


am 


+  2it 


(49) 


then  the  argument  in  equation  (47A)  must  be  a  real  number 

z tp  *=  z (f>  0  +  6,  *  0 

z  <j>z 

0  +  0A  -  0 
n  <J>n 

0 


z  '  z 
n$  =  n$ 


n  n 
ac(>  =  a<{> 


0  +  0, 

a  <t>a 


(50) 

(51) 

(52) 


By  writing  the  complex  quantum  numbers  z  ,  n  and  a  as 


z  *  zR  +  jZj.  *  z(cos  0z  +  j  sin  Q^) 


n 

a 


n^  +  jnj.  =  n(cos  6n  +  j  sin  ©n) 
a  +  jaT  *  a(cos  0  +  j  sin  6  ) 

K  J.  3  3 


(53) 

(54) 

(55) 


and  the  complex  number  azimuthal  angles  associated  with  z  ,  n  and  a  as 


n 


^zR  +  3* zl 
^nR  + 

*aR  +  3*oI 


*z(cos  %Z  +  j  Sin  %z) 

Vcos  %n  +  j  sin  V 

VCOS  %a  +  j  Sin  %a) 


(56) 

(57) 

(58) 


it  follows  that 
z<j> 


ZR*aR  '  2I*zI  +  3<ZI*aR  +  V2I> 


'  VoR  '  nI*nI  +  3  <nI*nR  +  Vnl> 
aia  *  VaR  -  aI*aI  +  3<VaR  +  Val> 


(59) 

(60) 
(61) 


If  the  imaginary  parts  of  equations  (59)  through  (61)  are  zero,  as  they  must  be 
if  equations  (50)  through  (52)  are  valid,  then 
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(63) 


Substituting  equation  (62)  into  equations  (59)  through  (61)  gives 

z? 


<ZR  +  h>WzR  '  2\r/zR  "  z\m/zR 


S„  *  <4  +  “^nRS  *  n2*nR/nR  "  I'\,,/nR 


S*a  "  (aR  +  a?HaR/aR 


2A  ,  2a  , 
a  <f>  _/aT,  -  a  &  /a„ 

aR  R  am  R 


(64) 

(65) 


Therefore  the  argument  in  equation  (47A)  is  linear  in  4>zm  ,  4^  and  4am 


If  the  wave  function  in  equation  (47A)  is  to  remain  unchanged  under  the 
transformations  in  equation  (49)  it  follows  from  equations  (63)  through  (65)  that 


22/Zr  -  z 


n^/n^  =  II 


a  /  a 


R 


(66) 


where  Z  ,  N  and  A  are  integers.  In  analogy  to  equations  (5)  and  (7)  for  the  mag¬ 
netic  quantum  numbers,  the  combination  of  equations  (66)  with  equations  (53) 
through  (55)  gives 


z  =  Z  cos  0 


0  =  -  0, 

Z  <\>Z 


n  =  N  cos  8 


n 


a  =  A  cos  0 


a 


6  -  -  ^ 
n  <|>n 


-  0 


4>a 


and  therefore  in  analogy  to  equation  (6) 


z  *  Z  cos  0  exp(i0  )  »  Z  cos  0,  exp (-10,  ) 
z  r  J  z  4>z  r  J  4>z 

n  =  N  cos  0  exp(i0  )  =  N  cos  8,  exp(-j0,  ) 
n  n  <5>n  q>n 

5  *  A  cos  ©a  exp(j6a)  *  A  cos  0^a  exp(-j6(j)a) 

Sometimes  it  Is  convenient  to  consider  the  terms 
Iz  “  cos  0z  exp(j0z)  -  cos  0^z  exp(-je^) 

Tn  -  cos  0n  exp(j0n)  *  cos  0^  exp(-j0^n) 

Ia  -  cos  es  exp(j8a)  -  cos  0^  exp(-j0(jia) 

Then  equations  (69)  through  (71)  become 


z  -  Z1 


n  ■»  N1 


A1 


(67) 

(68) 

(69) 

(70) 

(71) 

(72) 

(73) 

(74) 

(75) 


The  real  and  imaginary  parts  of  equations  (69)  through  (71)  are  written  as 


ZR 

-  z 

2fi 
cos  0 

z 

-  z 

cos28 

“R 

*  N 

cos^O 

n 

=  N 

cos20 

■r 

■*  A 

cos^Q 

a 

-  A 

2. 
cos  0 

4*2 
4>n 
4>  a 


(76) 

(77) 

(78) 


ZI 

=  Z 

cos 

0 

z 

a  in 

e  - 
z 

-  Z 

cos 

V  sln 

V 

nI 

-  N 

cos 

0 

n 

sin 

0  m 

n 

-  N 

cos 

0  ,  sin 
4>n 

0A 

4>n 

aI 

-  A 

cos 

0 

a 

sin 

0  * 
a 

-  A 

cos 

%a  8in 

0  , 
(pa 

:  these 

results 

follow 

from 

the 

periodicity 

(79) 

(80) 
(81) 


variables  <p 


zm  *  ^nm 


L  and  $ 


am 


The  law  of  addition  for  the  complex  atomic  number,  complex  neutron  number 
and  complex  atomic  mass  number  must  be  analogous  to  the  form  of  the  addition 
law  for  complex  magnetic  quantum  numbers  given  in  equation  (35) ,  so  that 


W  +  a  =•  z  +  n 


or 


W  +  A1 


Z1  +  N1 
z  n 


(82) 


(83) 


subject  to  the  integer  relation  of  the  universal  law  of  baryon  number  conserv¬ 
ation 


A  -  Z  +  N 


(84) 


In  equation  (82)  the  known  quantities  are  Z  ,  0Z  ,  N  and  0n  .  The  value  of  A  is 
immediately  obtained  from  equation  (84)  while  the  quantities  W  and  6a  are  ob¬ 
tained  by  noting  that  equation  (82)  can  be  rewritten  as 


W  +  A  cos20  ■  Z  cos20  +  N  cos20 
a  z  n 


A  cos  0  sin  0 
a  a 


Z  cos  0  sin  0  +  N  cos  0  sin  0 

z  z  n  n 


From  equations  (85)  and  (86)  it  follows  that 
W 


-  A/2[ 1  +  (1  -  4f2)1/2]  +  Z  cos2e  +  N  cos20 

z  n 


co820  -  1/2(1  +  (1  -  4f2)1/2] 

a 

f  -  A  *(Z  cos  8  sin  0  +  N  cos  6  sin  0  ) 

z  z  n  n 


(85) 

(86) 

(87) 

(88) 
(89) 
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which  have  the  same  form  as  equations  (37)  through  (39)  for  complex  magnetic 
quantum  numbers.  For  small  internal  phase  angles  0Z  0  and  0n  ^  0  ,  it  fol¬ 
lows  that  f  ^  0  ,  W  “V  0  and  6a  ^  0  .  In  this  case  the  function  W  can  be  ne¬ 

glected  to  a  first  approximation  in  nuclear  physics  calculations  as,  for  exam¬ 
ple,  in  the  determination  of  the  valley  of  beta  stability  which  is  done  in 
Section  4.  If  the  approximation  02  %  0n  is  assumed  then 

f  •v  cos  0  sin  0  (89A) 

z  z 

For  small  values  of  0Z  ^  0n  equation  (89a)  becomes  f  ^  6Z  ,  and  equation  (88) 

gives  0a  ^  &z  and  therefore  for  small  internal  phase  angles  the  following  ap¬ 

proximation  is  valid 

0  -v  0  -v  0-  (89B) 

z  n  a 

Equation  (88)  shows  that  0a  -  0a(0z  »  6n  »  2  ,  N)  so  that  0a  is  not  an  independent 
variable.  For  simplicity  it  will  sometimes  be  assumed  that  0Z  =  0z(Z,H)  , 

6n  =  6n(N,H)  and  0a  =  0a(Z,N,H)  ^  8a(A,H)  ,  where  H  =  strength  of  an  externally 
applied  field  such  as  a  magnetic  field  component  of  an  applied  electromagnetic 
field . 


B.  Time  Variation  of  the  Complex  Atomic  Number,  Neutron  Number 
and  Atomic  Mass  Number. 

This  subsection  considers  the  time  variation  of  z,  n  and  a  for  two  physical 
cases:  the  first  is  associated  with  changes  of  Z,  N  and  A  by  radioactive  trans¬ 
mutations  in  a  constant  magnetic  field,  and  the  second  case  is  associated  with 
the  time  variation  of  the  magnetic  field  for  fixed  values  of  Z,  N  and  A.  Equa¬ 


tions  (40)  through  (42)  give 

dz/dt  =  cos  6tt  (dz/dt  4-  jzd0z/dt)exp[  j  (6z  -  0t  -  (90) 

dn/dt  =  cos  Btt  (dn/dt  +  jnd0n/dt)exp[ j (0n  -  0t  -  Btt)]  <91) 

da/dt  =  cos  Btt  (da/dt  +  jad0a/dt)exp[ j (0a  -  0t  -  8tt)l  <92) 

where  Stt  is  given  by  equation  (10).  It  is  assumed  that  6Z  ,  6n  and  0a  are 

functions  of  an  applied  electromagnetic  field  so  that  for  simplicity 

0=0  (Z,H)  0  **  9  (N,H)  03  =  6  (A,H)  (93) 

z  z  n  n  a  a 

where  H  «  magnitude  of  a  static  magnetic  field  or  the  amplitude  of  the  magnetic 


field  component  of  an  electromagnetic  field.  Actually  the  precise  value  of  8a 
is  given  by  equation  (88).  The  magnetic  field  H  is  chosen  as  a  descriptive 
value  of  an  electromagnetic  field  because  in  Section  7  a  comparison  is  made  of 
the  effects  of  a  static  magnetic  field  and  an  electromagnetic  wave  field  on  the 
clean  fission  process  in  nuclei,  and  the  field  strength  H  is  a  common  variable 
of  description  for  the  two  fields  and  can  serve  as  a  means  of  comparison. 

For  the  case  when  Z,  N,  A  and  H  are  all  time  dependent  the  application  of 
the  chain  rule  for  derivatives  gives 


473 


(94) 


d0  / dt  -  36  /3Z  dZ/dt  +  30  /3H  dH/dt 
z  z  z 

de  /dt  -  36  /3N  dN/dt  +  30  /3H  dH/dt  (95) 

n  n  n 

d0_/dt  «  36  /3A  dA/dt  +  36  /3H  dH/dt  (96) 

a  a  a 

Combining  equations  (67)  with  equations  (90)  through  (96)  gives 

dz/dt  -  cos  8  (B  dZ/dt  +  C  dH/dt) exp [j (0,  -  0  -  B  ,)]  (97) 

cr  z  z  z  c  tt 

dn/dt  -  cos  et(.  (BndN/dt  +  CndH/dt)exp[ j (6q  -  0t  -  Btt)]  (98) 

da/dt  -  cos  (B  dA/dt  +  C  dH/dt)exp[ j  (6a  -  0  -  B ,.*.)]  (99) 

Lv  3  a  a  L  Ct 

where 

B  -  cos  0  +  Z38  /3Z  exp[j(ir/2  +  6,)]  (100) 

z  z  z  z 

B  -  cos  0  +  N36  /3N  exp[j(ir/2  +  8  )]  (101) 

n  n  n  n 

B  *  cos  6  +  A30  /3A  exp[j(ff/2  +  8  )]  (102) 

3  3  a 

C  -  Z30  /3H  exp[  j (n / 2  +  8  )J  (103) 

z  z  z 

Cn  -  N38n/3H  exp[j(Tr/2  +  0n)  3  (104) 

C  -  A38  /3H  exp[j(Tr/2  +  8  )]  (105) 

3  3  3 

Two  special  cases  are  of  interest. 

Case  a.  Constant  Magnetic  Field. 


This  case  corresponds  to  radioactive  transmutations  or  nuclear  reactions 
in  a  constant  magnetic  field.  For  this  case  equations  (97)  through  (99)  give 

(dz/dt)H  -  cos  8tt  §zdZ/dt  exp[j(0z  -  0t  -  Btt)3  (106) 

(dn/dt)H  -  cos  6tt  BndN/dt  exp[j(8n  -  0t  -  6tt)3  (’07) 

(da/dt)H  -  cos  Btt  badA/dt  exp[j(6a  -  8t  -  6tt)J  (108) 

For  this  case  the  actual  size  of  the  nucleus  is  changing. 

Case  b.  Constant  Z,  N  and  A. 

This  case  corresponds  to  internal  phase  changes  in  a  fixed  nucleus  in  the 
presence  of  a  time  varying  electromagnetic  field.  Equations  (97)  through  (99) 
then  give 
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(109) 


(dz/dt)  -  cos  8  C  dH/dt  exp{j(8  -  0  -  8_^) ] 

Z  tt  z  r  J  z  t  tt 

*  cos  8  Z30  /3H  dH/dt  exp[j(ir/2  +  20  -  6  -  6  .)] 

zz  z  z  t  tt 

(dn/dt)N  -  cos  8tt  C^dH/dt  exp[j(6n  -  0fc  -  8tt)]  (110) 

•  cos  0  N39  /3H  dH/dt  exp[j(ir/2  +  28  -  0  -  6  _)] 

tt  n  n  t  tt 

(dS/dt)A  -  cos  8tt  CadH/dt  exp[j(0a  -  6,.  -  Btt)]  (111) 

-  cos  8tt  A30a/3H  dH/dt  exp[j(ir/2  +  2Q&  -  -  8tt)  ] 

Case  b  corresponds  to  the  decay  of  nuclei  through  internal  phase  changes  associ¬ 
ated  with  a  time  varying  magnetic  field,  and  is  considered  in  detail  in  Section  3. 

The  time  derivatives  of  z,  n  and  a  given  in  equations  (97)  through  (99) 
can  also  be  written  in  a  more  general  form  involving  only  one  exponential  term 
by  noting  that  equations  (40)  through  (42)  give 

dz/dt  ■  cos  6  sec  8  dz/dt  exp(j<t  )  (112) 

Lu  Z Z  z L 

*  cos  0  esc  8_,  zd6  /dt  exp(j$  ) 

»  sin  Btt  sec  8ZZ  t-1dz/d0t  exp(j$zt) 

*  sin  Stt  esc  8ZZ  z/t  dez/d6t  exp(j$zt) 

dn/dt  -  cos  $tt  sec  8nn  dn/dt  exp(j<l>nt)  (113) 

■  cos  8tt  esc  8nn  nd6n/dt  exp(j*nt) 

-  sin  $tt  sec  6nn  t-1dn/d0t  exp(j$nt) 

-  sin  8tt  esc  Bnn  n/t  d6n/d9t  exp(j<tnt) 

da/dt  *  cos  8,.,.  sec  B  da/dt  exp(j$  )  (114) 

tt  3t 

■  cos  Btt  CSC  8aa  ad0a/dt  exp(j$at) 

«  sin  &tt  sec  8fla  t-1da/d8t  exp(j<J>at) 

-  sin  Btt  esc  8aa  a/t  d0a/d0t  exp(j*a(;) 

where  0tt  is  given  by  equation  (10),  and  where 
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tan  0  «*  z90  /3z  tan  0  -  n39  /3n  tan  0  -  a39  /3a  (1L5) 

zz  2  nn  n  aa  a 
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'  6tt 

(118) 

The  derivatives  dz/dt,  dn/dt  and  da/dt  are  evaluated  using  equation  (67)  as 
follows 

dz/dt  =>  dZ/dt  cos  0  -  Z  sin  0  d0  /dt  (119) 

z  z  z 

-  dZ/dt (cos  0  -  Z  sin  0  30  /3Z)  -  Z  sin  0  30  /3H  dH/dt 

2  Z  Z  2  Z 

dn/dt  *  dN/dt  cos  0  -  N  sin  0  d9  /dt  (120) 

n  n  n 

»  dN/dt(cos  8  -  N  sin  0  30  /3N)  -  N  sin  0  30  /3H  dH/dt 

'  n  n  n  n  n 

da/dt  =  dA/dt  cos  0  -  A  sin  0  d0  /dt  (121) 

a  a  a 

*  dA/dt(cos  9  -  A  sin  0  30  /3A)  -  A  sin  0  30  /3H  dH/dt 

a  a  a  a  a 

These  derivatives  appear  in  equations  (112)  through  (114). 

Combining  equation  (67)  and  equations  (112)  through  (115)  gives  for  the 
case  where  both  H  and  Z,  N,  A  are  varying 

dz/dt  «  J  cos  0  sec  0  cos  6  exp(j<t  )  (122) 

2  tc  ZZ  Z  Z  t 

-  Iz  cos  0tt  CSC  Bzz  cos  0z  exp(j$zt) 

-  (Iz  +  J*)1/2  COS  0tt  cos  0z  exp(j$zt) 

dn/dt  -  Jn  cos  0tt  sec  0nQ  cos  0n  exp(j*nt)  (123) 

“  Xn  cos  ett  csc  Snn  cos  9n  exP(^nt) 

-  <Xn  +  Jn)l/2  cos  8tt  coa  9n  e*P<J*nt) 
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(124) 


da/dt  =  J  cos  8__  sec  8  cos  0  exp(14>  _) 

a  tt  aa  a  r  J  at 

*  I  cos  fl  .  esc  8  „  cos  0  exp(j4>  _) 
a  tt  aa  a  K  J  at 

2  2  1/2 

=  (Ia  +  V  COS  3tt  COS  9  a  exP<^at) 

where 


tan  8  -  I  /J 

zz  z  z 


tan  6  *  I  /J 
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tan  8aa  "  VJa 
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X  =  tan  ou,  dZ/dt  +  Z/H  tan  crL  dH/dt 

In  *»  tan  4  dN/dt  +  N/H  tan  dH/dt 

I  =  tan  a^.  dA/dt  +  A/H  tan  dH/dt 

3l  AA  HH 


(126) 

(127) 

(128) 


=  (1  -  tan 

0 

z 

tan 

“zz>dz/dt 

-  Z/H 

tan 
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“HH 

dH/dt 

*  (1  -  tan 

0 

n 

tan 

“NK>dH/dC 

-  N/H 

tan 

0 

n 

tan 

N 

aHH 

dH/dt 

=  (1  -  tan 

0 

a 

tan 

'■L>dA/dt 

-  A/H 

tan 

0 

a 

tan 

A 

“hh 

dH/dt 

and  where 
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tan  a“  =  Z30  /3Z 

ZZ  Z 


tan  a  =  H36  /3H 
nn  Z 


tan  a  *  N30  /3N 
NN  n 

tan  crL  -  H30  /3H 
HH  n 


tan  *»  A30a/3A 


tan  a"  «  H38  /3H 
tin  a 


(129) 

(130) 

(131) 

(132) 

(133) 


Two  special  cases  are  of  interest!  for  application  to  nuclear  reactions 
and  transmutations  and  which  can  be  experimentally  verified. 

Case  a.  External  Radioactive  Decay  in  a  Constant  Magnetic  Field. 

For  this  case  it  follows  from  equations  (125)  through  (133)  that 


IH  -  tan  a dZ/dt 

Z 


*n  ’  Can  ”NN  dN/dt 


(134) 

(135) 


1“  ”  tan  a”  dA/dt 

3  AA 


(136) 


(137) 


=  (1  -  tan  0  tan  a!?_)dZ/dt 
Z  z  LL 

*  (1  -  tan  0n  tan  a^)dN/dt  (138) 

Jg  “  (1  -  tan  @a  tan  a^)dA/dt  (139) 

tan  8zz  «  Iz/Jz  "  tan  a2z  “  tan  9Z  tan  aZ2)  (140) 

tan  C  *  rn/Ja  ■  t,n  am  <l  -  tan  6n  tan  i5"'  <141> 

tan  Baa  *  “  tan  “L.  (1  ~  tan  6a  tan  “aa^"*1  (i42) 

®HH  *  1(12  °HH  “  */2  “mi  *  71/2  043) 

From  equations  (106)  through  (108)  and  (122)  through  (143)  it  follows  for 
constant  H  that 

(dz/dt)R  «  Jz  cos  0tt  sec  8zz  cos  ©z  exp(j>l>zt)  (144) 

"  ll  COS  6tt  csc  Bzz  COS  9Z  exP(3,tzt) 


, /TH,  2  ,  /tH  2,1/2  q  ,..H  x 

*  [(Iz)  +  (Jz)  ]  cos  Btt  COS  6z  exp(j4>"t) 

(dn/dt)  '=  JH  cos  8_„  sec  0H  cos  0  exp(j<J>H  )  (145) 

n  n  ct  tin  n  tit 

H  H  H 

■  I  cos  8.  .  csc  6  cos  0  exp(j<i>  ) 

n  tt  nn  n  nt 

r  /tH\ 2  /TH,2,l/2  a  a  y,.H  \ 

-  [(In)  +  (Jn)  1  COS  Btc  cos  0n  exp(j<I>nt) 

(da/dt)u  ~  cos  8,.,.  sec  8**  cos  0  exp(j$**  )  (146) 

u  3>  Cl  ft  fl  ft  ftt 

*  I**  cos  8,.,.  esc  8H  cos  0„  exp(j<t^  ) 

a  tt  aa  a  r  at 

-  [(I^)2  +  (Jg)2]1^2  cos  8tt  cos  0fl  exp(j4>at) 

where 
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Simple  algebra  shows  that  equations  (106)  through  (103)  are  equivalent  to  equa¬ 
tions  (144)  through  (146) . 

Case  b.  Internal  Phase  Radioactive  Decay  due  to  a  Time  Dependent 
Magnetic  Field. 

This  case  corresponds  to  constant  values  of  Z,  N  and  A.  Equations  (125) 
through  (133)  give 
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(154) 


aZZ  "  ir/2 


H 

“ira  *  W2 


“aa  ■  ’n 
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I2  -  Z/H  tan  dH/dt 

1N  -  N/H  tan  aJL  dH/dt 
n  titl 


IA  -  A/H  tan  aA  dH/dt 
a  tut 


-  -  Z/H  tan  6z  tan  dH/dt 

/  -  -  N/H  tan  0n  tan  dH/dt 

•  -  A/H  tan  0  tan  aA  dH/dt 

a  a  Hit 

For  constant  Z,  N  and  A  it  follows  that 


(156) 

(157) 

(158) 

(159) 

(160) 
(161) 
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[ (IZ) 2  +  (JZ)2]1/2  =  Z/H  tan  aZ  sec  6  dH/dt 

2  Z  tin  Z 

*  Z  30  /3H  sec  8  dH/dt 
z  z 

[(iJJ)2  +  (j^)2]1/2  -  N/H  tan  ct^,  sec  6n  dH/dt 

=  N  30  /3H  sec  0  dH/dt 
n  n 

[  (Ig)2  +  (J^)2]1/2  -  A/H  tan  sec  dH/dt 

■=  A  30  /3H  sec  6  dH/dt 
a  a 


(162) 


(163) 


(164) 


Combining  equations  (122)  through  (124)  and  equations  (162)  through  (164)  gives 
for  Z,  N  and  A  fixed 

(dz/dt)z  =  cos  &tt  Z(d0z/dt)z  exp[j(ir/2  +  20z  -  0t  -  6t(;)]  (165) 

*=  cos  0  Z  30  /3H  dH/dt  exp[j(n/2  +20  -  0  -  0  )] 

Cc  Z  Z  C  Cl 

(dn/dt)  »  cos  0  N(d0  /dt)  exp[j(w/2  +  20  -  0  -  0  )]  (166) 

n  cc  n  w  n  c  cc 

-  cos  0  N  30n/3H  dH/dt  exp[j(rc/2  +  20n  -  6fc  -  0fct)  ] 

(da/dt)A  =  cos  0tt  A(d0a/dt)A  exp[j(7r/2  +  20a  -  9t  -  0fct)  ]  (167) 

-  cos  0  A  30  /3H  dH/dt  exp[j(it/2  +  20  -  0  -  0  )] 

cc  a  a  c  cc 


Internal  phase  radioactive  decay  in  a  time  varying  electromagnetic  field  is 
treated  in  Section  3. 

C.  Dependence  of  the  Internal  Phase  Angles  on  Z,  N,  A  and  H. 

The  following  material  is  a  simple  way  jf  relating  the  internal  space 
phase  angles  9Z  ,  0n  and  0a  to  the  strength  of  an  applied  electromagnetic  field. 
It  has  been  assumed  that  the  complex  atomic  number  z  ,  neutron  number  n  and 
atomic  mass  number  a  arise  from  solutions  of  the  azimuthal  equations  (44)  through 
(46)  so  that  0Z  ,  0n  and  0a  are  given  by  equation  (68) .  Then  if  it  is  assumed 
that  the  internal  space  phase  angles  0$z  ,  0$n  and  0<j>a  of  the  complex  number  az¬ 
imuthal  angles  $z  ,  |n  and  corresponding  to  z  ,  n  and  a  as  given  in  eduction 

(43)  are  proportional  to  the  internal  phase  angles  0O  of  the  complex  number 
cartesian  coordinates  5  ,  where  a  *  x,  y,  z  with  0a  *  0X  ■  0y  ■  0z  ,  it  follows 
that 
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(168) 
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<fa  a  a 


(169) 

(170) 


where  wz  ,  wn  and  u)a  =  constants  whose  values  may  be  taken  to  be  of  the  order  of 
unity  as  a  first  approximation.  The  internal  space  phase  angles  of  the  complex 
number  cartesian  coordinates  are  related  to  the  internal  space  phase  angle  of  a 
magnetic  field  by5/ 
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For  free  particles  the  relation  0a  =  20 1  is  valid  so  that 
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(172) 
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(175) 


At  the  fission  condition,  equations  (473),  (487)  and  (488)  give 
a  (Z,A)  ^  2a  (Z,A) 

a  Z 


(176) 


The  constants  az(Z,A)  ,  an(Z,A)  and  aa(Z,A)  are  of  the  order  of  unity  to  a  first 
approximation  so  that  at.  incipient  fission 
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and  in  general  to  a  first  approximation  near  the  valley  of  beta  stability 
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(178) 


Therefore  the  internal  phase  angles  of  the  atomic  number,  neutron  number  and 
atomic  mass  number  can  be  controlled  by  applying  an  electromagnetic  field  whose 
internal  phase  angle  of  the  magnetic  field  component  0^  is  determined  by  the 
strength  of  the  applied  electromagnetic  field. 

It  is  expected  that  the  value  of  9^  will  depend  on  the  field  strength  of 
the  applied  electromagnetic  field  and  can  be  represented  as  a  power  series  as 
follows 

eH(B)  *  0R(O)  +  hjH  +  h2H2  +  •••  (179) 
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For  practical  calculations  it  will  be  assumed  that  ©^(O)  ^  0  .  Combining  equa¬ 
tions  (173)  through  (175)  with  equation  (179)  gives 


0=0  +  a  (h.H  +  h-H  +  •••) 

z  zo  z  1  2 

6  -  0  -fa  (h.H  +  h„H2  +  ••  •) 

q  no  n v  1  2 

6  -  0  +  a  (h.H  +  h«H2  +  •••) 

a  ao  a  1  2  ' 
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where 
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where  0ZO  ,  0no  and  0ao  ■  intrinsic  phase  angles  of  the  atomic  number,  neutron 
number  and  atomic  mass  number  for  zero  value  of  the  applies  electromagnetic  field. 
For  practical  calculations 
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The  magnetic  field  strength  H  can  refer  to  either  a  static  magnetic  field  or  the 
magnetic  field  component  of  an  electromagnetic  wave  such  as  in  the  case  of  y  rays. 


The  following  nuclear  magnetic  internal  phase  angle  compliance  coefficients 
can  be  defined 


(Z,A)  -  30  /3H  -  a  (h.  +  2h_H  +  3h-H2  +  -•-) 
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where 
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Combining  equations  (180)  through  (182)  with  equations  (188)  through  (190)  gives 
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6z  ■  6zo  +  CL  H  +  l'2cJ,l  1,2  +  1/3C9z2  1,3  +  <19l> 

9n  '  9 no  +  CL  “  +  1/2C6„l  +  1/3CL  1,3  +  —  <192> 

ea  '  6ao  +  CL  H  +  1/2C9al  ^  +  1/3C”a2  **  +  " '  »«> 

where  02O  ^  0  ,  0no  *v  0  and  6ao  %  0  and  can  generally  be  neglected. 

Equations  (191)  through  (193)  can  be  combined  with  equations  (97)  through 
(111)  to  calculate  dz/dt  ,  dn/dt  and  da/dt  in  terms  of  the  applied  electromag¬ 
netic  field.  Combining  equations  (132)  and  (133)  with  equations  (191)  through 
(193)  gives 

tan  JL  -  zae  /a z  +  zacH  /az  h  +  i/2zacH./az  h2  +  •••  (194) 


tan  a*  -  Z36  /3Z  +  Z3CH  /3Z  H  +  l/2ZaCH,/3Z  H2  +  ••• 

ZZ  zo  zo  zl 

tan  clL  -  N30  /3N  +  N3CH  /3N  H  +  1/2N3CH,/3N  H2  +  ••• 

nN  no  no  ni 

tan  a*J.  -  A30  /3A  +  A3CH  /3A  H  +  1/2A3CH,/3A  H2  +  (196) 

AA  3.0  30  3 1 
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“*  4  -  HCe«  -  H(Ceno  +  CL  H  +  C9„2  H2  +  •••) 

““  4  ■  HC0a  -  H(C9ao  +  ce.l  H  +  C9a2  ^  +  —  > 


(197) 
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The  expressions  in  equations  (194)  through  (199)  can  be  inserted  into  equations 
(122)  through  (167)  to  calculate  dz/dt  ,  dn/dt  and  dl/dt  in  terms  of  a  strength 
parameter  H  of  the  applied  electromagnetic  field.  For  example,  in  the  case  of 

nuclei  with  fixed  values  of  Z,  N  and  A  the  effect  of  a  changing  magnetic  field 

is  obtained  from  equations  (150)  through  (152)  and  (165)  through  (167)  to  be 

(3z/3H)z  -  Z(30z/3H)Z  exp[j(26z  +  ir/2)]  -  ZC^z  exp(j(6z  +  Bzz)]  (200) 

On/3H)N  -  N(30n/aH)N  exp[j(28n  +  */2)]  -  NC^  exp[j(6n  +  B^)]  (201) 

(35/3H)a  -  A(aea/3H)A  exp[j(28a  +  ir/2)]  -  AC®a  exp[j(0a  +  B^fl)l  (202) 

The  previous  equations  are  all  exact. 

For  simple  estimations  of  the  effects  of  an  electromagnetic  field,  the 
following  simple  linear  approximations  can  be  made  by  dropping  all  nonlinear 
effects  of  the  electromagnetic  field  in  the  magnetic  compliance  coefficients 
in  equations  (191)  through  (193) 
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It  is  often  convenient  to  invert  equation  (203)  as 
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where  the  nuclear  magnetic  internal  angle  stiffness  coefficients  are  given  by 
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Generally  it  is  more  convenient  to  use  the  magnetic  induction  field  B  in  cal¬ 
culations  so  that  the  following  corresponding  nuclear  magnetic  compliance  and 
stiffness  coefficients  are  defined  by 
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In  most  cases  the  higher  order  terms  in  equations  (185)  through  (187)  and  (191) 
through  (193)  can  be  ignored  so  that  the  subscript  "o"  can  be  dropped  and  the 
following  are  taken  to  be  exact  relationships 
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3.  RADIOACTIVE  DECAY  OF  ATOMIC  NUCLEI  IN  AN  ELECTROMAGNETIC  FIELD.  This 
section  considers  the  radioactive  decay  of  an  assemblage  of  identical  radioac¬ 
tive  nuclei  located  in  an  electromagnetic  field,  or  other  external  field  such 
as  gravity,  which  induces  internal  phase  angles  to  the  number  of  nuclei  and  the 
total  number  of  constituent  nucleons  contained  within  the  nuclei  and  to  the 
atomic  mass  number  of  each  nucleus.  The  addition  law  is  developed  for  the  com¬ 
plex  total  nucleon  number,  complex  number  of  atomic  nuclei  and  complex  atomic 
mass  number. 


484 


A.  Addition  Law  for  Complex  Total  Nucleon  Number,  Complex  Number 
of  Atomic  Nuclei  and  Complex  Atomic  Mass  Number. 

Consider  a  system  of  atomic  nuclei  in  an  external  electromagnetic  field. 
The  presence  of  the  electromagnetic  field  requires  that  particle  number  be  re¬ 
presented  as  a  complex  number  in  an  internal  space.  Therefore  the  complex  num¬ 
ber  of  total  nucleons  (protons  and  neutrons  within  the  nuclei) ,  the  complex 
number  of  atomic  nuclei  and  the  complex  atomic  mass  number  are  written  as 


N  *  N  exp(jb„  ) 
n  n  v  J  Nn7 

(215) 

N  -  N  exp(j0^) 

(216) 

a  *  a  exp(j0  ) 

(217) 

where  Nn  ,  Nn  and  6^n  *  complex  number  value,  magnitude  and 
of  the  total  number  of  nucleons  situated  within  all  of  the 

internal  phase  angle 
atomic  nuclei;  N  ,  N 

6jj  “  complex  number  value,  magnitude  and  internal  phase  angle  of  the  number  of 
atomic  nuclei;  and  as  before  a  ,  a  and  0a  =  complex  number  value,  magnitude  and 
internal  phase  angle  of  the  atomic  mass  number  of  each  nucleus.  In  analogy  to 
equation  (67)  the  magnitudes  that  appear  in  equations  (215)  through  (217)  are 
written  as 


N  =  n  cos  6., 
n  n  Nn 

(218) 

N  *  n  cos 

(219) 

a  =  A  cos  0 

a 

(220) 

where  n  *  integer  number  of  total  number  of  nucleons  within  the  atomic  nuclei, 
n  =  integer  number  of  atomic  nuclei,  and  as  before  A  *  atomic  mass  number  which 
by  definition  is  an  integer.  These  integer  numbers  satisfy  the  equation 

n  =  nA  (221) 

n 

which  represents  the  fundamental  law  of  baryon  number  conservation  which  is 
universally  valid.  The  measured  nucleon  number,  nuclei  number  and  atomic  mass 
number  are  given  by  the  real  parts  of  equations  (215)  through  (217)  respectively 

N  *  n  cos4"©..  N  *  n  cos^0„  a  <■  A  cos^0  (222) 

nm  n  Nn  hn  N  m  a 

and  are  not  integers. 

In  analogy  to  equation  (82)  which  applies  to  a  single  nucleus  the  addition 
law  for  the  complex  total  nucleon  number  is  given  by 

W  +  N  =  na  (223) 

n 

which  is  subject  to  the  validity  of  equation  (221).  The  component  equations  of 
equation  (223)  are 
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(224) 


2  2 
W*  +  n  cos  0„  =  nA  cos  0 

a  Nn  a 

n  cos  0..  sin  0.T  =  nA  cos  0  sin  6  (225) 

n  Nn  Nn  a  a  ' 

which  are  two  equations  for  the  two  unknown  quantities  W  and  0^n  .  It  is  easy 
to  show  that  the  solutions  to  equations  (224)  and  (225)  are 

cos20Nn  -  l/2[ 1  +  (1  -  4f ' 2) 1 /2]  (226) 

2  2 

W'  *  nA(cos  0  -  cos  0„  ) 

a  Nn 

-  nA{cos26  -  I/2[ 1  +  (I 


(227) 

-  f,2)1/2]} 


where 


f'  =  cos  0  sin  0  (228) 

a  a 

where  0a  is  assumed  to  be  a  known  function  of  the  applied  electromagnetic  field 
strength.  The  internal  phase  angle  0{j  of  the  number  of  atomic  nuclei  is  also 
assumed  to  be  a  known  function  of  the  external  electromagnetic  field  strength. 
Equations  (218)  through  (220)  can  be  used  to  determine  Nn  ,  N  and  a  in  terms  of 
the  electromagnetic  field  strength  H.  The  baryon  number  conservation  equation 
(221)  is  a  universal  law  which  is  valid  whether  or  not  an  electromagnetic  field 
is  present. 

B.  Radioactive  Decay  of  Nuclei  in  the  Presence  of  an  Electromagnetic  Field. 

Radioactive  decay  of  heavy  elements  has  been  studied  for  many  years. 58,59 
This  section  considers  the  radioactive  decay  of  heavy  elements  located  in  an 
external  electromagnetic  field.  The  generalization  of  the  standard  radioactive 
decay  law  for  elements  is  written  as 

dN/dt  -  -  IS  (229) 

where  N  -  complex  number  of  atomic  nuclei  and  X  ■  complex  number  radioactive 
decay  constant  which  can  be  written  as 

X  ■  X  exp(j0^)  (230) 

From  equations  (216)  and  (219)  it  follows  that  fj  can  be  written  as 

8  -  n  cos  9„  exp(19„)  K  ■  n  cos  9„  (231) 

N  K  n 

The  time  derivative  of  equation  (231)  can  be  written  as 
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(232) 


where 


dfi/dt  - 

cos 

6tt 

sec 

BNN 

s» 

cos 

6tt 

CSC 

6nn 

at 

sin 

Btt 

sec 

8nn 

SB 

sin 

8tt 

CSC 

8nn 

tan 

=  N30n/3N 

rW 


JN' 


-1 


dfi/det  exp(j$Nt) 


Nt 


0N  +  6NN  6t  '  8tt 


(233) 

(234) 

(235) 

(236) 

(237) 


Then  the  law  of  radioactive  decay  of  elements  given  in  equation  (229)  can  be 
written  in  any  of  the  following  forms 


cos 

8tt 

sec 

8NN 

dN/dt  -  -  AN 

(238) 

cos 

8tt 

CSC 

8nn 

de^/dt  -  -  X 

(239) 

sin 

8tt 

sec 

8nn 

t_i  dN/det  =  -  AN 

(240) 

sin 

8tt 

CSC 

8nn 

t“l  de„/de_  -  -  a 

N  t 

(241) 

combined  with  the  following  phase  angle  relationship 


*Nt  "  9X  +  0N 


(242) 


Combining  equations  (237)  and  (242)  gives 

8!<N  *  6t  -  6tt  *  6*  <243) 

The  derivative  d6jq/dt  that  appears  in  equation  (233)  is  written  as 

d0jj/dt  -  30N/3n  dn/dt  +  30N/3H  dH/dt  (244) 

whereas  the  derivative  dN/dt  that  appears  in  equation  (232)  is  obtained  from 
equation  (231)  to  be 

dN/dt  ■  dq/dt  cos  0 ^  -  n  sin  9^  d0jj/dt  (245) 

-  df|/dt(cos  0jj  -  n  sin  30jj/3n)  -  n  sin  30jj/3H  dH/dt 

Equations  (238)  through  (243)  give  the  general  forms  of  the  law  of  radioactive 
decay  of  elements  in  the  presence  of  an  external  field. 

An  additional  form  of  the  law  of  radioactive  decay  can  be  obtained  by 
noting  that  in  an  analogous  fashion  to  equations  (122)  through  (124)  the  time 
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derivative  dfl/dt  that  is  required  in  equation  (229)  can  be  obtained  from  equa¬ 


tions  (232),  (233),  (244)  and  (245)  to  be 

dS/dt  =  cos  sec  cos  ©N  JN  exp(j$Nt)  (246) 

-  cos  Btt  CSC  cos  eN  IN  exp(j*Mt)  (247) 

2  2  1/2 

-  cos  ett  cos  eN  (Ij  +  Jj)1/Z  exp(j$Nt)  (248) 

where 

IN  “  tan  dri/dt  +  n/H  tan  dH/dt  (249) 

Jjj  -  (1  -  tan  tan  )dn/dt  -  n/H  tan  0^  tan  dH/dt  (250) 


NaeN/9N  -  IN/JN  (251) 

n30^/3n  (252) 

H36n/3H  (253) 

Then  equation  (229)  can  be  written  as 

2  2  1/2 

cos  &tt  cos  0N  (IN  +  Jjj)1'  exp(j4>Nt)  (254) 

*  -  Xp  cos  0N  exp[j(6x  +  eN)] 

which  gives  the  radioactive  decay  law  as 

cos  3tt  (IN  +  JN)1/2  =  '  Xn  (255) 

and  equations  (242)  or  (243)  for  the  phase  angle  relationship.  Equation  (255) 

is  equivalent  to  equation  (238)  as  can  be  seen  from  equations  (245)  through 
(248) . 

At  this  point  the  standard  law  of  radioactive  decay  can  be  regained  when 

0j{  »  constant  ,  0t  -  0  and  gtt  ■  0  in  which  case  equations  (229) ,  (232) ,  (245) 

and  (254)  reduce  to 

cos  8jj  dn/dt  exp(j6jj)  -  -  An  cos  9jj  exp[j(0jj  +  0^)]  (256) 

which  gives  the  standard  law  of  radioactive  decay58*59 

dn/dt  -  -  An  9-0  (257) 

A 

which  corresponds  to  Incoherent  radioactive  decay  of  atomic  nuclei.  This 
limiting  case  can  also  be  obtained  from  equations  (246)  through  (255)  by 
noting  that 


tan  8 


tan  a. 


tan  a. 


m 

H 

nn 

n 

HH 
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0 


(258) 


eNN  ’  0 
Jjj  *  dn/dt 


aH  -0 

nn 

ZN  “  0 


HH 


(259) 


and  equations  (255)  and  (243)  become  equation  (257)  which  is  the  standard  equa¬ 
tion  for  incoherent  radioactive  decay. 


For  the  case  of  a  static  magnetic  field,  equations  (249),  (250)  and  (255) 
reduce  respectively  to 


*  tan  otH  dn/dt 

N  nn 

=  (1  -  tan  0M  tan  aH  ) dn/dt 

N  s  nn 


Y  cos  8  dn/dt  «  -  An 


(260) 

(261) 

(262) 


where 


Y  -  (tan2  ot^  +  (1  -  tan  6^  tan  a^)2]1/2  (263) 

Equation  (262)  can  be  rewritten  as 
dn/dt  =  -  A'n 

where  the  effective  decay  constant  is  given  by 

A '  -  AY"1  sec  8tt  (265) 


Therefore  the  effective  radioactive  decay  constant  is  predicted  to  be  an  in¬ 
creasing  function  of  the  strength  of  an  applied  static  magnetic  field.  The 
phase  angle  condition  for  this  special  case  is  obtained  by  first  noting  that 
from  equation  (251)  it  follows  that  for  H  «  constant 

tan  -  l”/jJJ  -  tan  (1  -  tan  tan  a^)”1  (266) 


and  equation  (243)  gives  the  phase  angle  condition 


as 


gH 


0-6 
t  ptt 


(267) 


where  -  internal  phase  angle  of  the  radioactive  decay  constant  for  atomic 
nuclei  in  a  constant  magnetic  field. 

C.  Coherent  Radioactive  Decay  of  Atomic  Nuclei. 

Consider  now  the  case  when  n  *  constant  which  corresponds  to  a  radio¬ 
active  decay  where  the  integer  number  of  atomic  nuclei  remains  constant  and 
only  the  internal  phase  angle  6g  changes  due  to  the  presence  of  a  time  de¬ 
pendent  electromagnetic  field.  For  n  «  constant  equations  (249)  and  (250)  give 
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(268) 


1^  -  n/H  tan  <4  dH/dt 
jJ  -  -  n/H  tan  0S  tan  dH/dt  (269) 

while  equation  (251)  gives  for  n  *  constant 

tan  8NS  "  "  cot  6N  “  "/2  +  0N  (270) 

Then  equations  (253),  (255),  (268)  and  (269)  and  equations  (243)  and  (270)  be¬ 
come  the  following  internal  phase  radioactive  decay  equations 

cos  8  sec  9jj  d9jj/dt  -  -  X  (271) 


\  +  es  -  9t  -  «CC 


g:'  -  e  -  b 

p{®  t  ptt 


(272) 


where  Btt  is  given  by  equation  (10).  Equations  (271)  and  (272)  can  also  be 
derived  directly  from  equations  (229)  and  (231)  by  noting  that  for  n  ■  constant 

dN/dt  -  n  cos  Btt  dO^/dt  exp[j(ir/2  +  20^  -  0t  -  Btt)]  (273) 

-  8  cos  Btt  sec  6jj  de^/dt  expt  j  (ir/2  +  6jq  ”  et  ”  8tt)J 

and 

d8  *  ndOjj  exp[j(ir/2  +  20^)]  |d8j  »  ndO^  (274) 

-  jnd8N  exp(j20jj) 

-  j8  sec  0jj  de,,,  exp(jd^) 

-  8  sec  0jj  d6jj  exp[j(ir/2  +  0^)] 


Equation  (274)  represents  the  coherent  form  of  change  of  the  special  type  of 
complex  number  given  in  equations  (216)  and  (219).  Equations  (271)  and  (272) 
are  coupled  simultaneous  differential  equations  that  determine  0g  and  6t  . 

Thus  for  slowly  changing  values  of  0t  equation  (272)  can  be  written  as 

0,  -  ir/2  +  6*.  -  0_  -  t30„/3t  (275) 

A  n  t  t 


which  is  an  approximate  equation. 

The  solution  to  the  coupled  internal  phase  radioactive  decay  equations 
(271)  and  (272)  is  in  general  not  simply  obtained  and  can  only  be  done  exactly 
by  numerical  methods  on  a  computer.  Some  insight  can  be  obtained,  however,  by 
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assuming  that  0t(t)  is  a  known  function  and  solving  equation  (271)  for  6$  . 
Simple  integration  of  equation  (271)  gives 

(sec  +  tan  6jj)/(sec  0^  +  tan  0^)  -  exp[-  Ag(t)]  (276) 

where 

,t 

g(t)  «  J  sec  g  dt 
o 

and  where  6^  =  for  t  =  0  is  an  initial  condition, 
assumed  that  g(t)  ■+  ®  so  that  for  this  limit 

sec  0”  +  tan  e“  =  0  (278) 

which  gives 

0“  -  -  ir/2  (279) 

From  equations  (272)  and  (275)  it  follows  that  the  asymptotic  values  of  0^  are 
given  for  t  -*■  «  by 

ex  ■  -  6t  -  Btt  <28°> 

^  -  0t  -  t30t/3t 

So  that  for  t  -*■  00  the  following  approximate  equation  is  valid 

6t  *  '  9X  +  c/t  (281) 

where  c  «  constant.  Therefore  for  t  -  «  it  follows  that  0t  -  0^  .  Therefore 
for  a  system  of  radioactive  nuclei  that  is  decaying  by  changes  in  the  internal 
phase  angle  of  the  nuclei  number  for  a  fixed  integer  number  of  nuclei,  the 
equilibrium  value  of  6t  that  is  obtained  after  a  long  period  of  time  is  given 
by  et  “  "  ®X  •  Chemical  reactions  obeying  the  mass  action  law  can  also  occur  by 
changes  in  the  internal  phase  angles  of  the  reactant  species  numbers. 

4.  NUCLEAR  MASS  FORMULA  FOR  ATOMIC  NUCLEI  IN  AN  EXTERNAL  ELECTROMAGNETIC 
OR  GRAVITATIONAL  FIELD.  Nuclear  mass  formulas  are  important  tools  for  investi¬ 
gating  nuclear  structure  and  nuclear  processes  such  as  fission  and  the  neutron 
and  proton  capture  by  atomic  nuclei.  "6 7  The  WeizsScker-Bethe  liquid  drop 
mass  formula  was  developed  years  ago  to  represent  nuclear  masses  by  combined 
classical  and  quantum  mechanical  techniques.60”67  This  section  develops  a 
broken  symmetry  form  of  the  liquid  drop  nuclear  mass  formula  that  is  required 
to  investigate  the  clean  fission  of  lighter  than  actinide  nuclei  by  thermal 
neutrons  in  the  presence  of  a  y  ray  field  that  is  considered  in  Sections  5 
through  7. 


(277) 

For  t  -*•  «>  it  will  be 


A.  Complex  Number  Nuclear  Radius  for  Subactinide  Nuclei. 

The  concept  of  a  nuclear  radius  enters  directly  into  the  derivation  of 
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the  liquid  drop  type  of  nuclear  mass  formula.  The  simplest  nuclear  radius 
formula  is60-67 

R  *  bA1/3  b  =  i.2  x  10-15  m  =  1.2  fm  (282) 

In  an  external  field  the  nuclear  radius  must  be  represented  by  a  complex  num¬ 
ber  in  an  internal  space.  The  complex  number  generalization  of  equation  (282) 
is  written  as 

R  *  R  exp(j0R)  -  ba1/3  (283) 

where  b  *  b  exp(j0b)  =  complex  number  constant  and  I  ■  complex  number  atomic 
mass  number  given  by  equation  (42).  Combining  equations  (217),  (220)  and  (283) 
gives  for  subactinide  nuclei 

R  «  bA1/3  cos1/30q  exp[j (0.  +  0/3)]  (284) 

3  D  3 

R  »  bA1/3  cos1/30a  0r  *•  Qb  +  ©a/3  (285) 

The  measured  nuclear  radius  is  given  by  the  real  part  of  equation  (284) 

Rm  =  R  cos  6R  =  bA1/3  cos1/30a  cos(6b  +  0a/3)  (286) 

Equation  (286)  can  be  rewritten  as 

Rm  -  b'A1/3  (287) 

where  for  subactinide  nuclei 

b’  «  b  cosi/30  cos (0.  +  0  /3)  (288) 

3  D  3 


The  effective  radius  constant  b'  decreases  with  increasing  strength  of  the  ap¬ 
plied  gravitational  or  electromagnetic  field.  For  the  relatively  weak  gravi¬ 
tational  field  of  the  earth  the  internal  phase  angles  are  very  small  and  there¬ 
fore  for  this  case 

b'  *  b  *  1.2  fm  (289) 

For  strong  gravitational  or  electromagnetic  fields  b’  is  given  by  equation 
(288)  with  b  -  1.2  fm  and  b'  <  b  . 

B.  Binding  Energy  of  Atomic  Nuclei  Located  in  an  Electromagnetic 
or  Gravitational  Field. 


The  standard  expression  for  the  binding  energy  B  of  a  nucleus  (Z,A)  is 
given  by  the  liquid  drop  model  as* 0-6 7 


B-e-E-E-E  +  E  +E 

v  *s  c  sym  pair  shell 

-  aA  -  yA2/3  -  6Z2/A1/3  -  8(N  -  Z)2/A  +  E  .  +  E  .  .. 

pair  shell 


(290) 
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where  Ey  ,  Es  ,  Ec  ,  ESym  »  Epair  and  Eshell  “  volume,  surface.  Coulomb,  sym¬ 
metry,  nuclear  pairing  and  nuclear  shel energies  respectively,  and  where  a  , 
y  ,  6  and  8  =  volume,  surface.  Coulomb  and  symmetry  energy  coefficients  respec¬ 
tively,  and  where  from  equation  (84)  it  follows  that  N  -  Z  =  A  -  2Z  .  The 
average  binding  energy  per  nucleon  e  -  B/A  is  written  as60-67 

£  -  E  -  e  -e  -e  +e  +eu11  (291) 

v  s  c  sym  pair  shell 

■  «  -  Yrt1/3  -  sz2/a4/3  -  HUH  -  z)/a]2  +  cpalr  +  cshell 

where  ev  ,  eg  ,  ec  ,  Egym  ,  epair  and  £shell  =  average  volume  energy  per  nu¬ 
cleon,  average  surface  energy  per  nucleon,  average  Coulomb  energy  per  nucleon, 
average  symmetry  energy  per  nucleon,  average  pairing  energy  per  nucleon  and  the 
average  shell  energy  per  nucleon  respectively.  More  complicated  forms  of  the 
nuclear  symmetry  energy  have  been  considered  by  including  the  effects  of  the 
nuclear  bulk  modulus.  8  However,  in  this  paper  only  the  simple  WeizsScker- 
Bethe  form  given  in  equation  (291)  is  considered. 

For  a  nucleus  in  the  presence  of  an  electromagnetic  or  gravitational  field 
the  complex  number  nuclear  binding  energy  is  written  as 


B  =  E  -  E  -  E  -  E  +  E 

v  s  c  sym  pair  shell 

__2/3  7-2  .—1/3  ->2r  ,  =  ,  = 

=  -  ya  -  6z  /a  -  $(n  -  z)  /a  +  Epair  +  EsheU 


(292) 


where  i  ,  n  and  a  are_given  by  equations  (40)  through  (42)  respectively,  Ey  ,  Es  , 
Ec  ,  Esyjn  .  %»air  and  Eshell  *  complex  number  volume,  surface.  Coulomb,  symmetry, 
pairing  and  shell  energies  respectively,  and  where  S  ,  y  ,  4  and  B  -  complex  num¬ 
ber  volume  surface.  Coulomb  and  symmetry  energy  coefficients  respectively.  The 
mass  formula  coefficients  are  represented  as 


5  ”  a  exp(j6a) 

6-6  exp (j  0^) 

E  .  -  E  .  exp(j6  ) 

pair  pair  Epair 


Y  =  y  exp(jB^) 

B  -  8  exp(j6g) 

^shell  ^shell 


The  average  complex  number  binding  energy  per  nucleon  e 

E-E  -  £  -  £  —  £  +  £  .  +  £  ,  .  . 

v  s  c  sym  pair  shell 


S*P(j®£shell) 

is  written  as 


(293) 

(294) 

(295) 


(296) 


-  5  -  y/I1/3  -  652/a4/3  -  8t(H  -  i)/5]2  +  epair  +  ^shell 

where  ev  ,  es  ,  ec  ,  esyi „  ,  £pair  and  £shell  *  complex  number  average  volume, 
surface.  Coulomb,  symmetry,  pairing  and  shell  energies  per  nucleon  respectively. 

The  complex  number  neutron  excess  that  appears  in  the  symmetry  energy 
terms  in  equations  (292)  and  (296)  is  written  as 
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C  »  C  exp(j0^.)  =  n-  z=*W  +  a-2i  (297) 

where  n  ,  z  and  I  are  given  by  equations  (40)  through  (42)  and  are  related  by 
equation  (82) .  Then  for  subactinide  nuclei 

2  2  2 

C  *  n  +  z  -  2zn  cos  (9  -  9  )  (298) 

n  z  ' 

2  2  2  2 

■  N  cos  9  +  Z  cos  9  -  2ZN  cos  0  cos  9  cos  (9  -  9  ) 

n  z  z  n  n  z 

tan  9r  -  (n  sin  0  -  z  sin  9  )/(n  cos  0  -  z  cos  6  )  (299) 

C  n  z  n  z 

■  (N  cos  9  sin  9  -  Z  cos  9  sin  9  )/(N  cos29  -  Z  cos20  ) 

n  n  z  z  n  z 

For  the  approximation  9Z  'v  9n  ,  which  is  valid  near  the  valley  of  beta  stability, 

it  follows  from  equations  (298)  and  (299)  that  for  subactinide  nuclei 

C  'v  n  -  z  »  N  cos  9  -  Z  cos  9  “v  (N  -  Z)cos  9  (300) 

n  z  z 

9.  s.  9  S/  9  (301) 

t,  z  n  ' 

Combining  equations  (296)  and  (297)  gives 

e  -  5  -  y/al/3  -  Si2/!*73  -  l(U a)2  +  e  .  +  e  .  ..  (302) 

pair  shell 

From  equation  (297)  it  follows  that  the  following  approximation  is  valid  for 
W  *  0  in  the  subactinide  elements 

C/I  -  C/a  exp[j(0  -  0a) ]  (303) 

^  1  -  2z/I 

i  -  2(Z  cos  0  )/(A  cos  9  )  exp[jf9  -  9  )] 

Z  A  Z  3 

Combining  equations  (302)  and  (303)  gives  the  following  approximation 

e  *  o  -  y/a^3  -  6z2/a^3  -  8(1  -  2z/a) 2  +  e  .  +  €  (304) 

pair  shell 

which  is  an  approximate  form  of  the  exact  equation  (302) . 

Equation  (292)  can  be  written  for  the  subactinides  as 

B  ■  oA  cos  0a  exp[j(9a  +  9fl)]  -  yA^3  exp(j(0^,  +  2/30^.’  ]  (305) 

-  6Z2A_1/3  cos29  cos"1/39  exp[j(9x  +  20  -  1/39  )] 

z  a  K  J  5  z  a 

-  BC2A  1  cos_19a  exp[j(9g  +  29?  -  0fl)] 

+  Epair  exp(^®Epair)  +  Eshell  ex*,«0Eahell) 
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As  an  approximation  the  phase  angles  of  the  terms  in  equation  (305)  are  taken 
to  be  equal 


e.  %  8  +9  %  6  +  2/39 

B  a  a  y  a 

-v  9-  +  26  -  1/39  ^  90  +  26r  -  0 
o  z  a  S  £  a 

^  0,,  .  %  0_  .  , , 

Epair  Eshell 


(306) 


Then  the  magnitude  of  the  binding  energy  is  obtained  from  equation  (305)  to  be 

(307) 


B  *  aA  cos  8  -  yA2/3  cos2/30  -  6Z2A-1/3  cos20  cos  1/30 

a  a  z  a 


-  BsV"1  cos" 1 6  +  E  .  +  E  , 

a  pair  shell 

Equation  (88)  gives  8a  *  0a (6Z , 0n ,Z,A)  so  that  in  all  further  calculations  it 
should  be  understood  that  6a  is  not  really  an  independent  variable.  The  approx¬ 
imation  6Z  *\»  9n  allows  equation  (307)  to  be  written  as 

B  ^  aA  cos  9  -  yA2^3  cos2/36  -  6Z2A_1^3  cos20  cos“1/30  (308) 

a  a  2  a 

-  8(N  -  Z ) 2 A~ 1  cos20  cos-19  +  E  .  +  E  .  . , 

z  a  pair  shell 

The  further  approximation  0^  ^  0Z  ^  0n  %  ea  allows  equation  (306)  and  (307)  to 
be  written  for  the  subactinides  as 

0n  ^  0  +0  ^  0  +  2/30  <309) 

B  a  a  y  a  w  ' 

^  0.  +  5/39  'v-  0„  +  0 
6  a  8  a 

^  0„  .  ^  0_  ,  ,, 

Epair  Eshell 

B  ^  aA  cos  6  -  yA2'3  cos2^3©  -  &Z2A~1'3  cos5/30  (310) 

a  a  a 

-  B(N  -  Z)2A_1  cos  6  +  E  .  +  E  ,  ,, 

a  pair  shell 

which  are  valid  in  the  valley  of  beta  stability. 

Each  term  in  the  expression  for  the  binding  energy  and  average  binding 
energy  per  nucleon  will  now  be  considered  separately  with  the  exception  of  the 
nuclear  pairing  and  shell  effects  which  are  more  complicated  and  are  not  con¬ 
sidered  in  this  paper. 

a.  Volume  Energy  Term  for  Subactinide  Nuclei. 

The  volume  energy  terms  are  written  as 
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(311) 


Ev  -  aa  -  aa  exp[j(8Q  +  6a) ] 

=  aA  cos  8  exp[j (8  +6)1 

a  r  J  a  a7  ‘ 

€v  -  5  -  o  exp(j6a)  (31 1A) 


The  volume  energy  per  nucleon  5  describes  the  energy  per  nucleon  of  infinite 
nuclear  matter  but  evaluated  at  the  central  density  of  a  nucleus.60-68  The 
following  complex  number  generalization  of  a  simple  density  dependent  expression 
for  o  can  be  used68 


-  -2  -  -3  -  -5 

b.k  -  b-kJ  +  bckJ  +  a 
2  c  3  c  5  c  ex 


where 


b2  -  b2  exp(j0b2) 
t>5  -  b5  exp(j0b5) 


b3  -  b3  exp(j0b3) 


ex 


a  exp (10  ) 

ex  r  J  a  ex 


(312) 

(313) 

(314) 


where  agx  *  complex  number  exchange  energy  term,  and  where  kc  “  complex  number 
fermi  wave  number  which  is  related  to  the  complex  number  central  density  of  a 
nucleus.  The  complex  particle  number  density  at  the  center  of  a  nucleus  is 
gl  /en  by  the  following  generalization  of  the  standard  scalar  result56 


nc  -  2/(3x2)k^ 


where 


(315) 


Sc  -  nc  «PlH,c) 

nc  -  2/(3*2)k^ 

-1 


\  -  k.  “P(3ekc> 
0  -  30, 


nc 


kc 


(316) 

(317) 


where  kc  s.  1.35  fm  .  The  values  of  a  and  8a  can  be  obtained  by  taking  the  real 
and  imaginary  parts  of  equation  (312)  as  follows 


a  cos  0a  -  b2k2  cos(0b2  +  20k(.)  -  b^  cos(0b3  +  30kc) 


(318) 


+  bck  cos(8.  e  +  50,  )  +  a  cos  0 
5  c  b5  kc  ex  aex 


a  sin  0Q  -  b2k2  sin(0b2  +  20kc)  -  sin(0b3  +  30^) 


(319) 


+  b_k  sin(8.  c  +  58.  )  +  a  sin  0 
5  c  b5  kc  ex  aex 


In  general  ©b2  ■  0 
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b.  Surface  Energy  Term  for  Subactinide  Nuclei. 


The  surface  energy  terms  are  written  as 
Eg  =  yS2//3  *  ya2^  exp[  j  (8^  +  2/3 0a>] 

*  yA2/3  cos2/^36  exp[j(0  +  2/30  )] 

3  Y  3 


(320) 


y/a1^  -  ya  1 ^3  exp[j(0  -  1/30  )  3 

T  a 

YA-1/3  cos”1/,30  exp{j(0  -  1/30  )  3 
a  y  a 


(321) 


The  complex  number  surface  energy  coefficient  can  be  written  as  the  following 
generalization  of  the  scalar  result68 


-  -  ,~2  -  r3  x  -  c5  x  - 

Y  *  c-k  -  c-k  +  cck  +  y 
1  2  c  3  c  5  c  ex 


where 


c2  ■  c2  exp(j0c2) 
55  -  c5  exp(j0c5) 


c3  -  c3  exp(j0c3) 

Y  »  Y  exp (j 0  ) 

ex  ex  Yex 


(322) 


(323) 

(324) 


where  kc  is  related  to  the  complex  particle  number  density  at  the  center  of  a 
nucleus  by  equation  (315),  and  where  Yex  “  exchange  energy  term  for  the  nuclear 
surface.  Formally  one  can  calculate  y  and  0y  by  taking  the  real  and  imaginary 
parts  of  equation  (322)  as  follows 


y  cos  0^  -  c2k2  cos(0c2  +  20kc)  -  c3k3  cos(0c3  +  30kc) 


c2  kc7  3  c 


+  c5kc  cos(0c5  +  50kc)  +  Yex  cos  0yex 


(325) 


Y  sin  0^  *  c2kc  sin(0c2  +  26kc>  -  c3kc  8ln(6c3  +  39kc> 

+  c5k3  sln(6c5  +  56k(.)  +  sin  6^ 

from  which  y  and  0y  are  easily  obtained. 

c.  Coulomb  Energy  Term  for  Subactinide  Nuclei. 

The  complex  number  Coulomb  energy  terms  are  written  as 
E  "  6I2/!^3  “  6z2a  ^3  exp[j(0.  +  20  -  1/30  )] 

C  O  Z  o 

-  6Z2A"1/3  cos20  co8“1/30  exp[ j (0  -  +  20„  -  1/36  ) ] 
z  si  o  z  a 


(326) 


(327) 


cc  -  622/I4/3  -  Sz2a_4/3  exp[j(06  +  26z  -  4/30.^} 

2  -4/3  2  -4/3 

=  6Z  A  cos  0  cos  8  exp[j(0.  +20  -  4/30  )] 

2  3  0  2  3 


(328) 


The  complex  number  Coulomb  energy  coefficient  can  be  written  as  a  generalization 
of  the  scalar  result69 


6  -  3/5(e  /b)  -  0.863/b  MeV 


(329) 


-  (0. 863/1. 523)k  MeV 

c 

where  b  -  complex  number  radius  parameter  defined  in  equation  (283).  Equation 
(329)  is  equivalent  to 


6  -  0.863/b  -  (0. 863/1. 523)k  MeV 

c 

e  «  -  0  «  e, 

6  b  kc 


(330) 

(331) 


For  simplicity  it  will  be  assumed  that  the  wave  number  kc  of  the  central  density 
of  an  atomic  nucleus  is  approximately  equal  to  the  wave  number  of  infinite  nu¬ 
clear  matter  kp  ,  so  that 


k,  ^  kp  -  1.35  fnf1 


(332) 


but  in  fact  kc  is  slightly  larger  or  smaller  than  kp  due  to  Coulomb  and  surface 
forces. 68 

d.  Symmetry  Energy  Term  for  Subactinide  Nuclei. 

The  complex  number  symmetry  energy  terms  are  written  as 


^sym  “  B€2/a  -  B52a-i  exp[j(0g  +  20^  -  6a)] 
-  8£2A-1  cos_19a  exp[ j (8g  +  20?  -  6a)] 


(333) 


Ssym  “  B(5/5)2  "  B52a"2  exp[j(8g  +  26^  -  20a)] 
-  B£2A~2  cos”20a  exp[j(0g  +  20^  -  20fl) } 


(334) 


where  %  ,  5  and  0^  are  given  by  equations  (297)  through  (299)  respectively. 
Equations  (333)  and  (334)  can  be  simplified  by  assuming  the  approximation 
®z  %  9n  »  then  equations  (298)  and  (299)  give 


5  %  (N  -  Z)cos  0 


0.^0  ~  0 
£  z  n 


(335) 


and  equations  (333)  and  (334)  become 
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(336) 


gsym  *  8(N  "  Z)2a_1  cos_lea  cos202  exP^(06  +  20z  "  6a)  1 

*  8(N  -  Z)V2  cos“20  cos20  exp[  j  (0  +  26,  -  26)]  (337) 

oyia  3.  c.  P  Z  aL 

As  a  further  approximation  let  0Z  ^  8n  6a  ,  which  follows  from  equation  (88) 
for  small  values  of  6Z  'v*  6n  ,  then  equations  (336)  and  (337)  become 

hym  x  6<N  "  Z)2a_1  cos  8a  exP^<86  +  ea>l  (338) 

^  8(N  -  Z)2A-2  exp(j6  )  (339) 

sym  p 

Equations  (338)  and  (339)  are  valid  in  the  vicinity  of  the  valley  of  beta  sta¬ 
bility  where  0Z  0n  ^  0a  . 

The  complex  number  symmetry  energy  coefficient  can  be  written  as  a  gen¬ 
eralization  or  a  corresponding  scalar  form  as  follows68 

8  *  e-k2  -  e_k3  +  e  k5  +  8  (340) 

2c3c5cex 

where 


e2  =  e2  exp(j0e2) 
e5  -  e5  exp(j9e5) 


e3  exp(j6e3) 


8ex  exp(308ex) 


(341) 

(342) 


where  8ex  *  complex  number  exchange  energy  contribution  to  the  quadratic  sym¬ 
metry  energy  coefficient.  The  values  of  8  and  0g  can  be  determined  from  the 
real  and  imaginary  components  of  equation  (340)  which  are  given  by 

8  cos  0g  =  e2k2  cos(9e2  +  29kc)  -  e-jk3  cos(0e3  +  30k<;)  (343) 


+  eck  cos(8  .  +  59,  )  +  8  cos  0O 
5  c  e5  kc  ex  8ex 


6  sin  03  -  e2k2  sin(0e2  +  2Bkc)  -  e3k3  sin(0e3  +  38kc)  (344) 

+  e5k3  sin(0e5  +  56kc)  +  8fix  sin  0^ 

In  general  0e2  m  0  because  this  term  arises  from  the  kinetic  energy  of  a  non¬ 
interacting  Fermi  gas,  however  0e3  4  0  and  0e5  ^  0  because  the  cubic  and  fifth 
order  terms  arise  from  nuclear  potentials  that  are  complex  numbers.  In  more 
complicated  forms  of  the  nuclear  mass  formula  the  symmetry  energy  is  separated 
into  volume  and  surface  components.68 

C.  Measured  Binding  Energies  of  Subactinide  Nuclei  Located 
in  an  External  Field. 

The  real  and  imaginary  parts  of  equation  (292)  are  given  by 
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2/3  2/3 

B  cos  0„  *  aA  cos  6  cos(9  +  0  )  -  yk  cos  0  cos(0  +  2/30  )  (345) 

D  3  Cl  3  3  "Y  3 

2  -1/3  2  -1/3 

-  6Z  A  '  cos  0  cos  '  0  003(6,.  +  26  -  i/30  ) 

Z  cl  J  2  3 

-  8£2A  *  cos  ^0  cos(6fl  +  20  -  0  ) 

3  p  Z  3 

+  E  .  COS  0_  .  +  E  .  cos  0r,  u  n 

pair  Epair  shell  Eshell 

2/3  2/3 

B  sin  0  «  aA  cos  0  sin(0  +  0  )  -  yA  cos  0  sin(0  +  2/39  )  (346) 

15  3  01  3  3  Y  3 

-  SZ2A-1/3  cos2©  cos~i/30  s in ( 0 ,  +  26  -  1/30  ) 

Z  3  6  2  3 

-  652A_i  cos_10  sin (0a  +  20.  -  0  ) 

3  P  s  3 

+  E  .  sin  9„  .  +  E  .  . .  sin  0  ,  n 

pair  Epair  shell  Eshell 

Equations  (345)  and  (346)  immediately  determine  B  and  0B  .  The  measured  binding 
energy  is  just  the  real  part  of  the  complex  number  binding  energy,  so  that 


_  _  .  .2/3  2-1/3  ,2-1  m  m 

B  -  a  A  -  Y  A  -  6  Z  A  -  6  (  A  +E  .  +EU11 

m  m  m  m  m  pair  shell 

where  for  subactinide  nuclei 


(347) 


a  =»  a  cos  0  cos(0  +  0  ) 
m  a  a  a 


(348) 


y  =  y  cos  0  cos (0  +  2/30  ) 

m  a  Y  a 

6  =6  cos20  cos  ^30  cos(0x  +  20  -  1/30  ) 
m  z  a  <5  z  a 


(349) 


(350) 


8'  -  B  cos  6  cos (8 Q  +  29r  -  9  ) 
m  a  8  £  a 


(351) 


where  £  ■  5(Z,N,0z,0n)  and  is  defined  by  equation  (298),  and  *  0^(Z,N,0z,0n) 
is  defined  by  equation  (299).  The  internal  phase  angle  0a  is  given  by  equation 
(88)  to  be  0a  -  6a(Z,N,0z,0n). 

If  the  approximation  0r  ^  0Z  ^  0n  is  assumed,  then  the  approximations  in 
equations  (335)  and  (336)  allow  equation  (347)  to  be  written  as 

B  -  a  A  -  y  A2/3  -  6  Z2A_1/3  -  8  (N  -  Z)2A_1  +  Em  +  E®  . .  (352) 

m  m  'm  m  mv  7  pair  shell 


where  now 


2  -1 

8  ■  B  cos  0  cos  9  cos(9„  +  20  -  0  ) 

m  z  a  p  z  a 


(353) 
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If  the  further  approximation  0^  ^  6Z  ^  0a  is  made,  then  equation  (352)  is  the 


measured  binding  energy  with 

-  a  cos  6a  cos(8o  4-  (354) 

Ym  -  Y  cos2/30a  cos (8  +  2/36a)  (355) 

6m  -  6  cos5/30a  cos(06  +  5/30a)  (356) 

&  -  8  cos  0a  cos(0„  +  0  )  (357) 

m  a  p  a 

which  are  useful  for  nuclei  near  the  valley  of  beta  stability.  The  measured 
values  of  the  symmetry  energy  coefficients  are60*61 

a  -  15.5  Y  -  17.2  6  -  0.698  8  -  23.3  MeV  (358) 

m  m  m  m 

Equations  (354)  through  (357)  show  that 

a  <  a  y  <  y  <$<  <5  8  <  6  (359) 

m  m  m  m 


The  values  of  the  nuclear  mass  formula  parameters  0Z  ,  0n  ,  0a  ,  a  ,  0a  ,  y  » 

0^  ,  6  ,  0$  ,  8  and  0g  can  be  obtained  by  fitting  the  real  part  of  the  complex 
number  binding  energy  given  by  equation  (347)  to  the  measured  values  of  the 
atomic  masses  of  the  elements.  A  simplified  procedure  uses  the  approximation 
0Z  “v  6n  and  equation  (352)  for  the  fit  to  atomic  masses.  Expressions  for  the 
atomic  masses  of  the  elements  will  now  be  considered. 

D.  Masses  of  Atoms  Located  in  an  Electromagnetic  or  Gravitational  Field. 

The  conventional  relationship  between  atomic  mass  and  nuclear  binding 
energy  is  written  as61 

M  -  Zn^  +  Nmn  -  B  (360) 

where  M  -  atomic  mass  of  an  element,  mg  ■  mass  of  hydrogen  atom  and  *  neutron 
mass.  In  the  presence  of  an  external  field  the  atomic  mass  is  a  complex  number 
in  an  internal  space  and  is  given  by 

M  -  zniy  +  nn>n  -  B  (361) 

where  the  complex  number  atomic  mass  is  written  as 

M  -  M  exp(j0M)  (362) 

Using  equations  (76) ,  (77) ,  (79)  and  (80)  allows  the  real  and  imaginary  parts 
of  equation  (361)  to  be  written  as 
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M  cos  8W  =  G 

M  sin  6W  =  F 
M 

where  for  subactinide  nuclei 

F  -  m.,Z  cos  8  sin  6  +  m  N  cos  8  sin  8  -  B  sin  8„ 

H  z  z  n  n  n  8 

2  2 

G  *  m..Z  cos  8  +  m  N  cos  8  -  B  cos  8„ 

H  z  n  n  B 

Equations  (363)  through  (366)  can  be  used  to  obtain  M  and  eM  as 

tan  0U  “  F/G 


(363) 

(364) 

(365) 

(366) 

(367) 


M 


2  2 
F  +  G 


B2  +  mjz2  cos20  +  mh2  cos28 
H  z  n  n 

+  2m_,m  ZN  cos  8  cos  8  cos  (8 
H  n  z  n  z 

-  2m^BZ  cos  0^  cos (8^  -  8g) 


6n> 


-  2m  BN  cos  8  cos (8  -  8_) 

n  n  n  B 


(368) 


The  measured  atomic  mass  is  given  by  equation  (363)  which  can  be  rewritten  as 

M  *  m,TZ  cos28  +  m  N  cos28  -  B  (369) 

mHznnm  ' 

The  approximation  8Z  ^  ^  combined  with  equations  (361)  through  (369)  gives 

0M  *  9z  %  9n  %  6B  (370) 

M  -v  (n^Z  +  mnN)cos  %z  -  B  (371) 

M  ^  (hl,Z  +  m  N)cos28  -  B  (372) 

m  H  n  z  m 

where  8a  is  given  by  equations  (88)  and  (89A)  within  this  approximation.  Note 
that  M  ,  and  8„  vary  with  the  strength  of  the  applied  electromagnetic  field 
because  02  ■  8Z(H)  in  equations  (370)  through  (372),  but  the  following  intrin¬ 
sic  mass  is  a  constant  independent  of  the  applied  electromagnetic  (or  gravita¬ 
tional)  field 


V  +  mnN 


constant 


(373) 


and  represents  the  universal  law  of  the  conservation  of  rest  mass  and  baryon 
number.  For  the  case  8Z  'v-  8n  ^  8a  *\»  8g  the  mass  relations  in  equations  (370) 
through  (372)  become  for  subactinide  nuclei 
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(374) 


6M"  9a 


H  a-  (m^Z  +  inN)cos  0a  -  B(Z,N,0  ) 
2, 


(375) 

Mm  ^  (mHZ  +  mnN)cos"ea  “  Bm(Z’N’6a)  (376) 

which  are  valid  near  the  valley  of  beta  stability  of  subactinide  nuclei. 


The  variation  of  the  measured  atomic  mass  and  the  magnitude  of  the  atomic 
mass  with  the  strength  of  the  external  electromagnetic  field  can  be  obtained 
from  equations  (368)  and  (369)  by  the  following  formulas 


dM  /dH  «  3M  /39  d8  /dH  +  3M  /38  d0  /dH  +  3M  /36  d0  /dH 
m  mzz  mnn  ra  a  a 

dM/dH  -  3M/30  d9  /dH  +  3M/30  d0  /dH  +  3M/30  d0  /dH 
z  z  n  n  a  a 


(377) 

(378) 


The  internal  phase  angle  6a  “  0a(6z »0n>Z»N)  is  given  by  equation  (88)  so  that 

(379) 


the  derivative  d0a/dH  can  be  evaluated  as 


d0  /dH  *  36  /36  d0  /dH  +  36  /30  d0  /dH 
a  a  z  z  a  n  n 

Therefore  equations  (377)  and  (378)  can  be  written  as 

dM  /dH  -  (3M  /39  +  3M  /30  36  /30  )d0  /dH 

m  m  z  ra  a  a  z  z 

+  (3M  / 30  +  3M  /30  30  /30  )d0  /dH 

ran  m  a  a  n  n 


(380) 


dM/dH  -  (3M/30  +  3M/30  30  /3 0  )d0  /dH 

Z  cl  cl  z  z 

+  (3M/30  +  3M/30  36  /30  )d0  /dH 

n  a  a  n  n 

where  for  example  equation  (369)  gives  for  subactinide  nuclei 

3M  /30  «  -  2ra,,Z  cos  0  sin  0  -  3B  /30 

m  z  H  z  z  m  z 

3M  /30  -  -  2m  N  cos  0  sin  0  -  3B  /30 

ran  n  n  n  m  n 

3M  / 30  -  -  3B  /3.8 

m  a  m  a 

where  Bn,  is  given  by  equation  (347) . 


(381) 


(382) 

(383) 

(384) 


For  the  approximate  case  02  ^  en  ^  ®a  »  which  follows  from  equation  (88) 
for  small  arguments,  it  follows  that 


dM  /dH  -  dM  /d6  d6  /dH 
m  maa 

dM/dH  -  dM/d0  d6  /dH 
a  a 


(385) 

(386) 
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where  equations  (375)  and  (376)  give  for  9Z  ^  6n  n,  0a  and  for  subactinide  nuclei 

dM  /d0  =  -  2(m„Z  +  m  N)cos  0  sin  0  -  dB  /d0  (387) 

m  a  H  n  a  a  m  a  v  ' 

dM/d0a  =  -  (n^Z  +  n^Osin  -  dB/d0a  (388) 

where  the  approximate  value  of  Bjj  given  by  equations  (352)  and  (354)  through 
(357)  are  used  in  conjunction  with  equation  (387),  while  the  approximate  value 
of  B  given  by  equation  (310)  is  used  in  conjunction  with  equation  (388).  There¬ 
fore  from  equation  (352)  and  within  the  approximation  0Z  ^  0n  ^  9a  ,  the  deriv¬ 
ative  in  equation  (387)  is  given  by 


dB  /d0  =  da  /d0  A  -  dy  /d0  A2^3  -  d6  /d0  Z2A~1/3  (389) 

mama  ma  ma 

-  dB  /d0  (N  -  Z)2A-1  +  dEm  .  /d6  +  dEm,  ../dB 

m  a  pair  a  shell  a 

where  from  equations  (354)  through  (357)  for  ez  ^  6n  ^  0a  in  subactinide  nuclei 

da  /d6  =  -  afsin  9  cos(0  +  0  )  +  cos  0  sin(6  +  0  )]  (390) 

m  a  ci  a  &  3  ct  3. 

dy  /d0  =  -2/3y[cos  ^30  sin  0  cos(0  +  2/36  )  +  cos2/,39  sin(0  +  2/30  )]  (391) 

ma.  a  a  y  a  a  y  & 

dd  /d0,  -  -  5/3d(cos2/30  sin  6,  cos(6.  +  2/39,)  +  cos5/30  sin(0*  +  5/30  )]  (392) 
m  a  a  a  o  a  a  o  a 


dB  /d0  »  -  B[sin  0  cos(9  +  0  )  +  cos  6  sin(0o  +  0  )] 

tn  a  a  p  a  a  p  a 


(393) 


For  the  approximation  0Z  =  0n  =  0a  ,  the  derivative  of  the  magnitude  of  the  bind¬ 
ing  energy  that  appears  in  equation  (388)  is  obtained  from  equation  (310)  to  be 

dB/d0  -  sin  0  [  -  aA  +  2/3yA2/3  cos~1/30  +  5/3dZ2A-1/3  cos2/30  (394) 

a  a  a  a 

+  B<S  *  Z>2a''J  +  dEp,lc/d«.  +  dEsh,ll/d9a 
From  equations  (387)  through  (394)  it  follows  that  for  subactinide  nuclei 


dB  /de 

,  <  0 

dB/d0  <  0 

m 

a 

a 

dM  /d0  <0 

m  a 

dM/d0  <  0 
a 

(395) 

(396) 


For  example,  within  the  approximation  ez  *  0n  *  ea  these  equations  give 

2/3  -1/3 

dH/d©  -  -  sin  0  [m_Z  +  m  N  -  aA  +  2/3yA  '  cos  /J0„ 
a  a  H  n  a 

+  5/36Z2A_1/3  cos 2/30a  +  6(N  -  Z)2A_i] 


(397) 


-  dE  .  /d0  -  dE  .  ../dO, 

pair  a  shell  a 
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which  is  a  negative  number  because  of  the  dominant  contribution  of  the  rest 
mass  terms. 

E.  Valley  of  Beta  Stability  for  Nuclei  with  Broken  Internal  Symmetries. 

Radioactive  beta  decays  require  that  the  complex  atomic  number  z  adjust 
adjusts  itself  so  as  to  minimize  the  binding  energy  of  a  nucleus  given  by  equa¬ 
tion  (292)  but  subject  to  the  constraints  represented  in  equations  (82)  and 
(84). 66  Combining  equations  (82)  and  (292)  gives  after  neglecting  shell  and 


pairing  energy  effects 

B  -  SI  -  ya2/3  -  6l2/a1/3  -  6  (a  -  2z  +  W)2/I  (398) 

The  minimum  binding  energy  condition  is 

3B/3z|-  -  -  26z/a1/3  +  46(5  -  2z  +  W)/a  -  0  (399) 

1  a 

which  gives  using  equation  (82)  with  W  »  0 

IVS  -  5/2(1  +  ci2/3)_i  (400) 

nVS  =  5/2(1  +  25I2/3)(1  +  ca2/3)_1  (401) 

where  vs  ■  valley  of  beta  stability,  and  where 

c  -  6/(46)  c  -  6/(46)  0c  *  8^  -  0g  (402) 

For  relatively  small  nuclei  the  beta  stability  condition  is 

zVS  -  a/2  zVS  -  a/2  9VS  -  0  (403) 

z  a 

nvs  -  a/2  nvs  -  a/2  0VS  -  0a  (404) 

n  3 

Combining  equations  (40),  (41)  and  (404)  gives  the  beta  stability  condition  for 
light  nuclei  as 

ZVS  -  A/2  NVS  -  A/2  (405) 


which  is  the  standard  beta  stability  condition  for  small  nuclei.65  Therefore 
for  light  nuclei,  the  standard  scalar  theory  given  by  equation  (405)  is  valid 
because  0gS  *  0a  .  For  medium  weight  nuclei  the  following  approximation  to 
equations  (400)  and  (401)  can  be  used 


i"  -  1/2(1  -  ca2/3> 

(406) 

nve  -  1/2(1  +  llZ'h 

(407) 

where  c  is  given  by  equation  (402),  and  where  W  =  0. 

VS 

For  heavy  nuclei  the  exact  equation  (400)  must  be  used  to  calculate  z 
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Equation  (400)  can  be  written  as 


—VS 

z  =  (G  +  JF)/D  (408) 

tan  0^s  «  F/G  zVS  -  (G2  +  F2)^2/D  (409) 

where 

G  *  a/2  cos  0  [1  +  ca 2/3  cos(0  +  2/36,)]  +  c/2  a5/3  sin  0  sin(0  +  2/30  )(410) 

»  C  a  3  C  a 

F  -  a/2  sin  6  [1  +  ca2/3  cos(0  +  2/30,)]  -  c/2  a5/3  cos  9  sin(8  +  2/30  ) (411) 

d  SC  d 

D  «*  [i  +  ca2/3  cos(8  +  2/30,)  ]2  +  c2a4/3  sin2(0  +  2/38  )  (412) 

C  3  C  3 

Combining  equations  (67)  and  (410)  through  (412)  gives  for  subactinide  nuclei 

G  -  A/2  cos20  [1  +  cA2/3  cos2/30  cos (8  +  2/38  )]  (413) 

a  a  c  a 

S/3  5/3 

+  c/2A  '  cos  '0  sin  8  sin(6  +  2/30  ) 

3  3  C  3 

F  -  A/2  cos  9  sin  6  [1  +  cA2/3  cos2/39  cos(0  +  2/38  )]  (414) 

a  a  a  c  a 

-  c/2A5/3  cos8/30  sin (8  +  2/36  ) 

a  c  a 


D 


[1  +  cA2/3  cos2/30  cos (0  +  2/30  )]2 

a  c  a 

+  c2 AA/3  cos4/36  sin2  (8  +  2/39  ) 

a  c  a 


(415) 


If  A  and  0a  are  taken  to  be  the  known  quantities,  then  equation  (82)  involves 
five  unknown  quantities  W  ,  Z  ,  N  ,  02  and  6n  .  The  complex  number  equation  (82) 
and  the  scalar  equation  (84)  supply  three  equations  for  determining  the  five  un¬ 
known  quantities.  The  complex  number  valley  of  beta  stability  equation  (399) 
supplies  two  additional  equations  and  so  a  complete  solution  is  possible.  From 
equations  (409)  and  (413)  through  (415)  that  describe  the  valley  of  beta  stabil¬ 
ity  for  the  general  case  of  nuclei  of  arbitrary  size  it  follows  that  in  general 


0VS(A,6  ) 
z  a 


ZVS(A,0a) 


(416) 


Combining  equations  (67)  and  (409)  gives  for  subactinide  nuclei 

Zvs  -  zvs/cos  8VS  -  (G2  +  F2)/(DG)  (417) 

z 

NVS  -  A  -  ZVS  -  A  -  (G2  +  F2)/(DG) 


(418) 


The  value  of  6j[s  is  obtained  exactly  from  equation  (82)  which  is  written  in  the 
form 


-vs  _  ,  „vs  -vs 
n  =  a  +  W  -  z 


(419) 


but  it  is  most  easily  obtained  approximately  by  taking  Wvs  ■  0  with  the  result 

tan  6VS  -  (A  cos  0  sin  6  -  ZVS  cos  0VS  sin  0VS) (A  cos20  -  Zvs  cos20VS)_1  (420) 
n  aa  z  z  a  z 

where  0^s  and  Zvs  are  given  by  equations  (409) .  The  measured  values  of  the  atomic 
number,  neutron  number  and  atomic  mass  number  in  the  valley  of  beta  stability 
are  given  for  subactinide  nuclei  by 


vs  „vs  2nvs 
z  ■  Z  cos  0 
m  z 


vs  V,vs  2. vs 
n  =  N  cos  0 
m  n 


a  »  A  cos20  (421) 

m  a 


which  are  generally  not  integers. 

Consider  now  the  determination  of  the  specific  nucleus  within  the  valley 
of  beta  stability  that  has  the  greatest  average  binding  energy  per  nucleon  when 
an  external  field  is  present.  Combining  equations  (304)  and  (403)  gives  the 
average  binding  energy  per  nucleon  for  nuclei  within  the  valley  of  beta  stabil¬ 
ity  approximately  as 


eVS  =  a  -  y/51/3  -  6/4a2/3  +  £pair(a)  +  5shell<S) 


(422) 


Then  the  nucleus  having  the  greatest  average  binding  energy  per  nucleon  after 
neglecting  nuclear  shell  and  pairing  effects  is  given  by 


devs/dl  =  l/3y/a4/3  -  1/66 /a1 /3  -  0 


which  gives  approximately 

igb 

,8b 


agb  =  2y/6 


2y/6 


>gb 


0-6. 
y  6 


(423) 

(424) 

(425) 


where  gb  -  greatest  binding.  From  equation  (425)  it  follows  that  approximately 

Agb  cos  0gb  -  2y/S  (426) 

a 

or  equivalently 

Agb  -  2y/<$  sec(8y  -  8g)  (427) 


The  conventional  prediction  of  the  most  stable  nucleus  is 
Agb  -  2y/6 


59-66 


(428) 


and  therefore  for  subactinide  nuclei 
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(429  -  444) 


A®*5  =  A0*3  sec(0  -  0.)  >  A®*5 
c  y  <5  c 

The  peak  in  the  average  binding  energy  per  nucleon  curve  for  nuclei  in  an  ex¬ 
ternal  electromagnetic  or  gravitational  field  is  shifted  to  a  higher  value  of 
atomic  mass  number  than  is  predicted  by  conventional  calculations,  but  this 
effect  is  small  if  0y  “v  6$  .  The  calculation  of  the  nucleus  of  greatest  aver¬ 
age  binding  energy  per  nucleon  within  the  valley  of  beta  stability  has  been 
done  using  the  approximation  given  in  equations  (403)  and  (404)  for  which  case 
the  nuclear  symmetry  energy  term  vanishes.  An  exact  calculation  requires  the 
use  of  equations  (400)  and  (401)  for  which  the  symmetry  energy  term  in  equa¬ 
tion  (304)  does  not  vanish  as  it  does  in  equation  (422).  The  inclusion  of  the 
symmetry  energy  term  in  the  calculation  of  the  subactinide  nucleus  of  greatest 
binding  within  the  valley  of  beta  stability  is  algebraically  difficult  and  will 
not  be  done  in  this  paper. 

This  section  suggests  that  a  nuclear  mass  formula  which  is  suitable  for 
subactinide  nuclei  in  an  external  electromagnetic  or  gravitational  fie.M  must 
include  complex  number  values  of  the  atomic  number  and  atomic  mass  number  and 
complex  values  of  the  volume,  surface.  Coulomb,  symmetry,  pairing  and  shell 
energy  coefficients.  Nuclear  properties  such  as  neutron  binding  energies 
and  neutron  capture  and  fission  cross  sections  will  also  be  complex  numbers  in 
an  internal  space,  and  will  depend  on  the  strength  of  the  external  field  because 
the  magnitudes  and  phase  angles  f  all  terms  of  a  nuclear  mass  formula  are  af¬ 
fected  by  the  external  field.  The  measured  nuclear  properties  will  correspond 
to  the  calculated  real  values  of  the  complex  number  nuclear  properties.  The 
particular  forms  chosen  for  the  complex  number  representations  of  the  atomic 
number,  neutron  number  and  atomic  mass  number  refer  generally  speaking  only 
to  subactinide  nuclei.  More  precisely,  the  forms  chosen  in  equation  (67)  refer 
only  to  nuclei  for  which  x  ^  1  where  x  “  fissility  parameter,  and  this  condi¬ 
tion  is  generally  valid  for  subactinide  nuclei.  A  forthcoming  paper  will  treat 
the  case  x  ^  1  which  begins  to  occur  in  the  actinides. 

5.  LOW  ENERGY  FISSION  OF  LIGHTER  THAN  ACTINIDE  NUCLEI  IK  AN  ELECTROMAG¬ 
NETIC  FIELD.  This  section  derives  the  conditions  required  for  the  spontaneous 
or  thermal  neutron  induced  fission  of  lighter  than  actinide  nuclei  (subactinide 
nuclei)  that  are  located  in  an  electromagnetic  field,  determines  the  magnitude 
of  the  internal  phase  angle  of  the  atomic  number  that  is  required  to  bring  a 
subactinide  nucleus  into  a  state  of  incipient  fission,  and  gives  a  simple  rela¬ 
tionship  between  this  critical  internal  phase  angle  and  the  strength  of  the 
electromagnetic  field  required  to  catalyze  the  clean  fission  of  subactinide 
nuclei.  Under  zero  field  conditions  spontaneous  or  thermal  neutron  induced 
fission  occurs  only  in  some  of  the  actinide  and  transactinide  elements  but  does 
not  occur  in  the  subactinide  elements.6 9-80  Some  of  these  heavy  elements  such 
as  Z35U  and  2  3  9Pu  are  used  in  conventional  nuclear  reactors.  The  possibility 
of  thermal  neutron  induced  fission  in  elements  lighter  than  the  actinides  in  the 
presence  of  an  electromagnetic  field  will  have  practical  applications  to  the 
design  of  clean  fission  nuclear  reactors  because  the  fission  products  will  be 
relatively  light  elements  which  are  either  stable  against  beta  decay  or  are  only 
low  level  beta  emitters.  An  atomic  nucleus  is  a  quantum  many-body  system  which 
has  both  collective  effects  such  as  volume  and  surface  energies,  and  independent 
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particle  motion  effects  such  as  shell  structure.  Both  collective  motion  and 
independent  particle  motion  are  important  to  the  description  of  the  fission 
process.69-80  For  instance,  the  shell  structure  determines  the  nature  of  the 
doubly  humped  fission  barrier.69-80  In  this  paper  only  the  simple  collective 
terms  such  as  volume,  surface,  Coulomb  and  symmetry  energies  are  considered  for 
the  description  of  the  catalysis  of  thermal  neutron  induced  fission  in  the  sub¬ 
actinide  nuclei  by  an  applied  electromagnetic  field  which  in  this  case  must  be 
a  y  ray  field. 

A.  Bohr-Wheeler  Fission  Condition  for  Nuclei  in  an  Electromagnetic  Field. 

The  standard  Bohr-Wheeler  analysis  for  spontaneous  or  thermal  neutron  in¬ 
duced  nuclear  fission  utilizes  the  fissility  parameter  which  is  defined  by69-80 

X  =  (Z2/A)(ky/<5)-1  (445) 

and  the  spontaneous  and  thermal  neutron  induced  fission  condition  is  written 
for  actinide  nuclei  as69-80 

X  >  1  Z2/A  >  ky/S  (446) 

where  y  and  6  =  surface  and  Coulomb  energy  coefficients  that  appear  in  the  liq¬ 
uid  drop  nuclear  mass  formula  treated  in  Section  4,  and  where  theoretically  for 
spontaneous  fission 

<  -  g/h  -  2  (447) 

where  g  =  2/5  and  h  =  1/5  are  the  second  order  series  expansion  coefficients  of 
the  surface  and  Coulomb  energies  respectively  when  these  terms  are  expanded  in 
terms  of  an  ellipsoidal  deformation  parameter.69-00  The  values  of  <  ,  y  and  6 
along  with  the  other  mass  formula  parameters  are  determined  empirically.69-80 
The  values  of  <  are  different  for  spontaneous  and  for  thermal  neutron  induced 
fission,  and  in  fact  k  is  dependent  on  the  energy  of  the  incident  neutrons.69-80 
For  thermal  neutron  induced  fission69-80 


k  *  1.471 


(448) 


Choosing  y  =«  17.2  MeV  and  6  *  0.698  MeV  yields  the  following  fission  conditions 
for  a  zero  value  of  the  externally  applied  field69-80 

2 

Z  /A  >  49.28  spontaneous  fission  (449) 


Z2/A  >  36.25 


thermal  neutron  induced  fission 


(450) 


These  inequalities  show  that,  loosely  speaking,  only  the  actinides  and  trans¬ 
actinides  can  undergo  spontaneous  or  thermal  neutron  induced  fission,  but  not 
all  of  these  heavy  elements  undergo  fission.  Within  this  group  of  heavy  ele¬ 
ments  the  more  neutron  rich  isotopes  tend  to  be  more  stable  against  fission, 
for  example  238U  is  stable  against  thermal  neutron  induced  fission  but  23SU  is 
fissile.  The  empirical  value  of  <  that  describes  thermal  neutron  induced  fis¬ 
sion  will  depend  on  the  values  selected  for  the  mass  formula  parameters  y and  <5  . 
In  general  k  can  be  taken  to  be  e  decreasing  function  of  the  kinetic  energy  of 


509 


the  incident  neutrons.  The  fission  criteria  presented  above  ignore  all  shell 
structure  effects  and  are  therefore  approximate  relations  which  show  only  gen¬ 
eral  behavior  and  for  which  counterexamples  can  always  be  found  in  the  border 
region  between  fissile  and  non-fissile  nuclei. 

The  generalization  of  equation  (446)  to  the  case  of  atomic  nuclei  located 
in  an  electromagnetic  or  gravitational  field,  which  breaks  the  symmetry  of  the 
atomic  number,  neutron  number  and  atomic  mass  number,  can  be  written  as 

z2/a  >  tcy/6  (451) 

where  z,a,y  and  6  are  given  by  equations  (40),  (42),  (293)  and  (294)  respect¬ 
ively.  The  fission  instability  boundary  is  given  by 

z2/S  -  tcy/6  (452) 

or  equivalently  the  two  scalar  fission  stability  boundary  conditions  are 

z2/a  =  <y /6  (453) 

0  =26  -  e  +6  (454) 

&  z  y  o 

Therefore  in  an  external  field  the  internal  phase  angles  of  the  atomic  number, 
atomic  mass  number,  surface  energy  coefficient  and  the  Coulomb  energy  coeffi¬ 
cient  enter  into  the  fission  instability  condition.  Equations  (453)  and  (454) 
will  now  be  solved  to  determine  the  critical  value  of  0Z  and  the  critical  value 
of  the  electromagnetic  field  strength  that  are  required  to  catalyze  the  fission 
of  subactinide  nuclei. 

B.  Critical  Value  of  the  Internal  Phase  Angle  of  the  Atomic  Number  that 
is  Required  for  the  Low  Energy  Fission  of  Subactinide  Nuclei  in  an 
External  Field. 

Combining  equation  (67)  with  equations  (453)  and  (454)  gives  the  fission 
instability  boundary  for  nuclei  located  in  an  external  field  as 

Z2/A  **  tcy/6  cos  8  cos  20  (455) 

a  z 

_2 

=  tcy/6  cos(20  +0.-0  )cos  0 
z  o  y  z 

Equation  (455)  must  be  solved  for  0Z  .  This  can  be  done  by  noting  that  simple 
trigonometry  gives 

cos(20z  +  0£  -  0^)cos  2©z  -  (1  -  p2)cos(0^  -  6^)  +  2p  sin(8^  -  0^)  (456) 

where 


p  *  tan  0 

z 

Equation  (455)  can  then  be  written  as  a  quadratic  equation 


(457) 
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(458) 


ap 


2 


+  bp  +  c  -  0 


where 


a  =  cos(9  -  0X) 

Y  0 

(459) 

b  *  2  sin(0 .  -  0  )  -  -  2  sin(0  -  0.) 

0  y  y  0 

(460) 

c  *  6/(ky)22/A  -  cos(0  -  0.)  **  y  “  cos(0  -  0.) 

Y  0  Y  0 

(461) 

F 

Then  the  critical  angle  for  fission  0z  is  given  by 

tan  0F  =*  tan(0  -  0.)  ±  sec(0  -  9,)[1  -  6/(ky)Z2/A  cos(0 

-  0J]1/2  (*62) 

1/2 

-  tan(8Y  -  06)  ±  sec(0Y  -  ©5>  1 1  -  X  cos(8y  -  0fi) ] 


F 

and  from  equation  (454)  the  corresponding  critical  angle  for  fission  0a  is  given  by 

0F  -  20F  +  8.  -  0  (463) 

a  z  5  y 

F  F 

The  angles  0Z  and  0a  are  the  critical  values  of  the  phase  angles  6.  md  8a  re¬ 
spectively  that  are  required  to  bring  a  nucleus  (Z,A)  into  a  state  or  incipient 
fission. 


Equation  (462)  is  the  equation  for  the  instability  boundary  for  the  fission  of 
a  nucleus  in  the  presence  of  an  external  field,  and  is  valid  for 

0  <  Z2/A  <  <y/6  sec (8  -  0.)  (464A) 

y  <5 

or  in  terms  of  the  fissility  parameter 


0  <  x  <  sec(0Y  -  06) 


(464B) 


t* 

The  values  of  6£  corresponding  to  x  * 
are  given  by 

tan  0F~  *  tan(0Y  -  0^)  ±  sec(0 ^ 


0  for  the  positive  and  negative  modes 
-  e6)  (465) 


Using  simple  trigonometric  identities  gives 

eF±  -  ±  w/4  +  1/2(0  -  0.) 

zo  Y  o 


(466) 


F  F 

The  common  value  of  6Z  and  0a  for  both  positive  and  negative  modes  corresponding 
to  x  *  aec(ey  -85)  is  given  by 


,Fc 


,Fc 


0-0. 
Y  o 


(467) 


F  F 

The  ranges  of  variation  of  0Z  and  02  for  the  positive  and  negative  angle  modes 
subject  to  x  ^  sec(8y  -  05)  are  given  by 


511 


(468) 


,FC 

z 

< 

0F 

z 

eF+ 

zo 

eF- 

zo 

< 

eF 

Z 

< 

0FC 

z 

Fc 

a 

eF 

a 

< 

w/2 

-  tt/2 

< 

eF 

a 

< 

eFc 

a 

(469) 


The  condition  for  spontaneous  or  thermal  neutron  induced  fission  in  an  external 
field  is  obtained  from  equation  (462)  to  be  for  subactinide  nuclei 


F  < 

where  x  “ 

X  <  sec(0  -  0.) 

Y  o 

fissility  parameter,  and  where 

(470) 

F  * 

(1  -  X2)sec(0y  -  0fi) 

(471) 

X  - 

[tan  0  -  tan(0  -  0.)]cos(0  -  0.) 

z  Y  o  y  6 

(472) 

If  the  external  field  is  shut  off  all  the  internal  phase  angles  have  zero  values 
and  x  *  0  and  F  *  1  so  that  equation  (470)  reduces  to  the  result  x  “  l  for  sub¬ 
actinide  nuclei. 


As  a  first  approximation  the  condition  0^  *  6g  can  be  taken  in  equation 
(463)  and  the  phase  angle  condition  for  spontaneous  or  thermal  neutron  induced 
fission  in  an  external  field  is 

eF  «  20F  -  20F  (473) 

a  z  n 


Combining  equations  (455)  and  (473)  gives  the  approximate  fission  instability 
boundary  for  nuclei  in  an  external  field  as 


Z2/A  =  (ky/6)(1  -  tan20F)  (474) 

or  equivalently  as 

Z2/A  «  (ky/6)[1  -  tan2 (0F/2) ]  (475) 

The  condition  for  fission  is  therefore 

(Z2/A)(ky/<$)-1  >  1  -  tan20  (476A) 

z 

or  equivalently 

1  -  tan28z  <  x  <  1  0z  >  9F  (476B) 


which  follows  directly  from  equations  (470)  through  (472)  when  0^  -  0g  .  In  the 
presence  of  an  electromagnetic  field  the  fission  condition  is  given  by  equation 
(476B) ,  while  for  a  zero  external  field  equation  (446)  gives  the  fission  condi¬ 
tion.  From  equations  (474)  through  (476)  it  is  clear  that  In  the  presence  of  an 
external  field  nuclei  lighter  than  the  actinides  can  fission  spontaneously  or  by 
thermal  neutron  absorption  unless  this  is  prevented  by  nuclear  shell  effects  as 
will  be  the  case  for  some  subactinide  nuclei.  In  general  the  application  of  an 
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electromagnetic  or  gravitational  field  tends  to  lead  to  more  fission  instability. 
The  conclusion  of  this  paragraph  that  subactinide  nuclei  in  the  presence  of  an 
external  field  can  be  fissioned  by  thermal  neutrons  depends  on  the  validity  of 
the  following  conditions 


6  —  8X 

<<  e 

z 

(477) 

A  'v.  tan 

0 

z 

(478) 

F  ^  1  - 

2 

tan  0 

z 

(479) 

where  the  functions  F  and  A  are  given  by  equations  (471)  and  (472)  respectively. 
Equations  (477)  through  (479)  are  generally  true  if  0y  'v  65  ,  and  therefore 
F  <  1  in  equation  (470) . 


From  equation  (474)  it  follows  that  the  internal  phase  angle  of  the  atomic 
number  that  is  required  for  the  fission  instability  of  subactinide  nuclei  is 
given  approximately  for  the  positive  angle  mode  by 


tan  eF  =  [1  -  6/(ky)Z2/A]1/2 
z 


=  (1  -  X) 


1/2 


(480) 


Equation  (480)  can  also  be  obtained  directly  from  the  exact  equation  (462)  by 
making  the  approximation  Qy  =  6$.  Equation  (480)  can  also  be  written  as 


cos  0F  =  [2  -  6/(ky)Z2/A]“1/2 
z 


=  (2  -  X) 


-1/2 


(481) 


sin  0F  =  {[1  -  6/0cy)Z2/A]/[2  -  5/ (ky)Z2/A] }1/2 

-  [(1  -  X>/(2  -  X)l1/2 

Equivalently  equations  (480)  and  (473)  give  approximately 
0F  -  tan”* [  1  -  <5/(Kt)Z2/A]i/2 
-  tan”1 (1  -  X) 1/2 


(482) 


1483) 


a 


2  tan-1{l 
2  tan”1 (1 


-  6/(Ky)z2/a]1/2 


-  X) 


1/2 


(484) 


Equations  (480)  through  (482)  are  valid  for  dy  -  85  and  the  following  range  of 
the  fissility  parameter 

0  <  Z2/A  <  <y/6  (485) 
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or 


0  <  X  <  1  (486) 

F  F 

for  which  the  range  of  values  of  0,  and  0_  are 

Z  a 

0  <  0F  <  it/4  (487) 

O<0F<  tt/2  (488) 

which  correspond  to  the  exact  relations  given  in  equations  (468)  and  (469)  re¬ 
spectively.  For  small  values  of  x  the  following  approximations  are  valid 

tan  0F  %  1  -  x/2  (489) 

sin  0F  %  2-1/2(l  -  x/4)  (490) 

cos  0F  -v  2_1/2(1  +  x/4)  (491) 

The  internal  phase  angle  of  the  atomic  number  at  incipient  fission  0  as  given 

1  .1  •  .  •  *  .  .  /  »  n  \  .  _  .  . 


by  the  approximation  in  equation  (483)  is  presented  in  Figure  1 . 

C.  Determination  of  the  Electromagnetic  Field  Strength  Required  to 
Catalyze  Fission  in  Subactinide  Nuclei  by  Thermal  Neutrons. 


The  value  of  the  magnetic  field  required  to  bring  a  lighter  than  actinide 
nucleus  to  the  point  of  fission  instability  is  obtained  from  equations  (209)  and 
(210)  as 

F  F ,  H  H  F 

H  =  0  /C"  «  K„  0  (492) 

Z  0Z  0z  z  7 

F 

where  0Z  is  given  for  the  general  case  by  equation  (462).  Therefore  in  general 
HF  -  hf<  ,  ey  -  05  ,  x)  (493) 

F 

where  the  f issility  parameter  x  is  given  by  equation  (445) .  If  6Z  is  given  by 
the  approximate  equation  (483)  the  critical  value  of  the  magnetic  field  required 
to  catalyze  fission  in  a  subactinide  nucleus  using  thermal  neutrons  is  given  by 

HF  -  f^z  tan-1 (1  -  X)1/2  (494) 

-  Kp  tan-1 [ 1  -  6/(kx)Z2/A]1/2 
oz 


so  that  within  this  approximation 
HF  -  HF(Kqz  ,  X) 


(495) 
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Equation  (494)  is  valid  for  x  <  L  • 

The  condition  x  <  l  generally  occurs  in  subactinide  nuclei  or  in  the 
actinides  with  large  neutron  excess,  while  the  condition  x  5  1  generally  oc¬ 
curs  in  the  actinide  and  transactinide  nuclei.  Equation  (494)  gives  the  mag¬ 
netic  field  strength  required  to  catalyze  spontaneous  or  thermal  neutron  in¬ 
duced  fission  in  nuclei  for  which  X  <  1  except  in  those  nuclei  where  nuclear 
shell  effects  add  increased  stability  and  do  not  allow  fission  to  occur.  For 
the  case  when  X  >  1  nuclei  will  generally  fission  spontaneously  or  by  thermal 
neutron  induced  fission  without  the  presence  of  an  electromagnetic  field  ex¬ 
cept  for  those  nuclei  where  shell  effects  give  increased  stability  against 
fission  as  for  example  in  the  case  of  2  3  8U.  If  the  value  of  K0Z  is  very  large 
the  magnetic  field  strength  required  to  catalyze  clean  fission  in  the  subac¬ 
tinide  nuclei  will  be  too  large  for  laboratory  demonstration  and  practical 
nuclear  reactor  design.  In  Section  7  it  will  be  shown  that  the  static  mag¬ 
netic  field  required  to  catalyze  clean  fission  in  the  subactinide  elements 
is  in  the  teratesla  range  and  is  too  large  for  practical  purposes.  However,  in 
Section  7  it  is  also  shown  that  the  application  of  a  y  ray  electromagnetic  field 
with  a  relatively  low  magnetic  field  vector  strength  can  catalyze  clean  fission 
in  the  subactinide  elements  using  thermal  neutrons  but  only  when  nuclear  shell 
stability  does  not  prohibit  the  occurrence  of  the  phenomenon. 

6.  FINAL  STATE  ENERGY  CONDITIONS  FOR  THE  FISSION  OF  NUCLEI  IN  AN  EXTERNAL 
FIELD .  This  section  considers  a  comparison  between  the  initial  and  final  energy 
states  of  an  atomic  nucleus  that  has  fissioned  in  the  presence  of  an  external 
electromagnetic  or  gravitational  field.  A  fission  reaction  in  which  a  nucleus 
(Z.A)  has  split  into  two  nuclei  (Zj.  ,  A^)  and  (Z2  ,  A2)  is  written  in  the  form6  9-8  0 

(Z,A)  •*  (Zx  ,  Aj)  +  (Z2  ,  A2)  (496) 

Then  the  nucleus  (Z2  ,  A2)  is  assumed  to  eject  a  neutron 

(Z2  ,  A2)  -  (Z2  ,  A2  -  1)  +  (0,1)  (497) 

where  in  this  notation  (0,1)  is  a  single  neutron.  In  this  way  the  general  pro¬ 
cess  of  nuclear  fission  can  be  represented  by  a  nuclear  transformation  of  the 
general  form  given  in  equation  (496) .  In  an  external  field  the  nuclei  repre¬ 
sented  in  equation  (496)  are  also  associated  with  complex  atomic  numbers,  neutron 
numbers  and  atomic  mass  numbers  that  in  analogy  to  equations  (40)  through  (42) 
are  represented  for  subactinide  nuclei  by 

z  =  z  exp(j9  )  «  Z  cos  9  exp(j9  ) 
z  z  z 

n  -  n  exp(j9n)  «  N  cos  9n  exp(j9n) 

a  *  a  exp(j9  )  *>  A  cos  9  exp(j9  ) 

3  3  3 


(498) 

(499) 

(500) 
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zl  «  z1  exp(j0zl)  -  Zl  cos  0zl  exp(jezl)  (501) 

=  nl  exp^6nl^  =  NI  cos  6nl  exp^0ni)  (502) 

ij  =  ax  exp(j0al)  =  cos  0al  exp(j0ai)  (503) 

z2  =  z2  exp(j@z2)  =  Z2  cos  0z2  exp(j0z2)  (504) 

n2  *»  n2  exp(j0n2)  =  N2  cos  0n2  exp(j0n2)  (505) 

a2  *  a2  exp(j0a2)  =  A2  cos  0a2  exp(j0a2)  (506) 


The  determination  of  the  energy  released  during  the  fission  reaction  given  in 
equation  (496)  requires  that  all  of  the  nine  internal  phase  angles  that  appear 
in  equations  (498)  through  (506)  be  determined,  and  the  procedure  for  doing  this 
will  now  be  given. 

A.  Determination  of  the  Internal  Phase  Angles  of  the  Atomic  Number, 
Neutron  Number  and  Atomic  Mass  Number  for  the  Initial  and  Final 
States  of  a  Fission  Reaction  for  Subactinide  Nuclei. 


The  nuclei  involved  in  the  fission  reaction  given  by  equation  (496)  are 
subject  to  the  following  scalar  baryon  number  conservation  equations 


A  -  Z  +  N 

Al“Zl+Nl 

A2  *  Z2  +  N2 

(507) 

A  =  Aj  +  A2 

Z  =  Z1  Z2 

N  *  Nx  +  N2 

(508) 

In  an  external  field  the  nuclei  represented  by  equation  (496)  are  subject  to 
the  following  complex  atomic  number,  neutron  number  and  atomic  mass  number 
conservation  equations  similar  to  equation  (82) 

a  +  W  *  z  +  n 

a^  +  *  z^  +  n^ 

^2  +  W2  =  ^2  +  ™2 

(509) 

a  +  W  =  a,  +  S„ 
a  I  2 

z  +  W  *  z,  +  z„ 
z  1  2 

n  +  W  *  n,  +  n0 
n  1  2 

(510) 

Equations  (509)  and  (510)  show  that  all  of  the  W's  are  not  independent,  and 
in  fact  they  are  subjec.:  to  the  following  equation 

W  -  W.  -  W  »  W  -  W  -  W  (511) 

1  2  d  z  n 

Equations  (509)  and  (510)  can  be  combined  with  equations  (498)  through  (506)  to 
yield  the  following  twelve  equations  for  subactinide  nuclei 

A  cosZ9  +  W  ■  Z  cosZ0  +  N  cosZ0  (512) 

a  z  n 

A  cos  9  sin  0  ■  Z  cos  0  sin  0  +  N  cos  0  sin  0  (513) 

a  a  z  z  n  n 
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A  2A 

A.  cos  6  , 

1  al 

+  W1 

"  Z1 

2  2 
cos  0  ,  +  N.  cos  0  . 
zl  1  nl 

(514) 

A1  cos  6al 

sin 

9al  = 

Z.  cos  0  ,  sin  0  ,  + 
1  z  1  zl 

N1 

cos 

0  .  sin  8  , 
nl  nl 

(515) 

A2  cos20a2 

+  W2 

=  Z2 

2  2 
cos  0  _  +  N0  cos  0  „ 
z2  2  n2 

(516) 

A2  cos  0a2 

sin 

6a2  = 

Z2  cos  0z2  sin  0z2  + 

N2 

cos 

0  «  sin  6  0 
n2  n2 

(517) 

2 

A  cos  9  + 

a 

W  ** 
a 

2  2 

Aj  cos  0  j  cos  ®a2 

(518) 

A  cos  9  sin  0 
a  a 

=  A1 

cos  6  .  sin  6  .  4-  Aa 
ai  al  1 

cos 

9a2 

sin  0  „ 
a2 

(519) 

2 

Z  cos  9  + 

z 

W  - 

z 

2  2 

Z.  cos  0  .  +  Z_  cos  9  _ 

1  zl  2  z2 

(520) 

Z  cos  0  sin  9 
z  z 

=  Z1 

cos  0  ,  sin  0  .  +  Z0 
Zl  zl  z 

cos 

0z2 

sin  0  „ 
z2 

(521) 

2 

N  cos  0  + 

n 

W  = 
n 

2  2 

N.  cos  9  .  +  N„  cos  0  » 

1  nl  2  n2 

(522) 

N  cos  0  sin  0 
n  n 

=  N1 

cos  0  .  sin  9  ,  +  N0 
nl  nl  2 

cos 

6n2 

si”  en2 

(523) 

Z  ,N  , A  ;  : 

ZL  ,  N 

1  *  A1 

and  Z2  ,  N2  » A2  are  known  quantities. 

There  are  fifteen  unknown  quantities  in  the  problem  of  the  fission  of  an 
atomic  nucleus  in  the  presence  of  an  electromagnetic  field: 


W  ,  0  ,0  ,0 

(524) 

z  n  a 

W1  *  6zl  ’  9nl  ’  9al 

(525) 

W2  ’  0z2  ’  0n2  ’  0a2 

(526) 

Wz  *  Wn  *  Wa 

(527) 

There  are  fifteen  equations  to  determine  these  quantities  and  they  are:  the 
twelve  equations  (512)  through  (523),  the  two  fission  instability  equations 
(453)  and  (454)  which  determine  9Z  and  9a  in  the  forms  of  equations  (462)  and 
(463),  and  finally  equation  (511)  which  relates  the  various  W-functions.  The 
values  of  the  W's  are  similar  in  form  to  the  functions  required  for  the  addi¬ 
tion  law  of  complex  magnetic  quantum  numbers  as  given  in  equation  (37),  and 
are  written  as  follows  for  subactinide  nuclei 

W  »  -  A/2U  +  (1  -  4fJ)1/23  +  Z  cos29  +  N  cos29  (528) 

w  z  n 

-  -  A1/2[l  +  (1  -  4f^x) 1/2]  +  Zl  cos29z1  +  Nl  cos28nl  (529) 
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W2  -  -  A2/2[l  +  (1  -  4f22)1/2]  +  z2  cos2ez2  +  n2  cos2en2 

Wz  -  -  Z/2[ 1  +  (1  -  4f2z)1/2]  +  Zl  cos20zl  +  Z2  cos20z2 

w_  -  -  N/2 [  1  +  (1  -  4f  2  )1/2]  4-  N,  cos2e  .  +  N0  cos20  _ 
n  Wn  l  ni  Z  n2 

Wfl  *  -  A/2[l  +  (1  -  4f^a)^2]  +  A^  cos2©aj  +  A2  cos20a2 


(530) 

(531) 

(532) 

(533) 


where 


W1 


W2 


A~l (Z  sin  € 

l  cos  0 
z  : 

+  N  sin 
z 

0  cos  i 
n 

V 

(534) 

-1 

A1*(Zi 

sin 

8zl 

cos 

6Zl 

+  N1 

sin 

6nl 

cos 

6nl> 

(535) 

A21(Z2 

sin 

6z2 

cos 

ez2 

+  N2 

sin 

6n2 

cos 

6n2> 

(536) 

Z_1(Z1 

sin 

9zl 

cos 

ezl 

+  Z2 

sin 

6z2 

cos 

0z2> 

(537) 

-1 

N  *(Nj 

sin 

6nl 

cos 

6nl 

+  N2 

sin 

6n2 

cos 

0n2> 

(538) 

A_1(Al 

sin 

9al 

cos 

0al 

+  a2 

sin 

®a2 

cos 

9a2> 

(539) 

and  where 


cos20a  -  1/2[1  +  (1  -  4f2)1/2] 
cos20al  -  l/2[ 1  +  (1  -  Af^x) 1/2] 
cos26a2  -  l/2[ 1  +  (1  -  4f22)1/2] 
cos2ez  -  1/2 [ 1  +  (1  -  4f2z)1/23 
cos20n  -  1/2[1  +  (1  -  4f2n)1/2] 
cos2ea  -  i/2[i  +  (l  -  4f2a)1/2] 


(540) 

(541) 

(542) 

(543) 

(544) 

(545) 


Therefore  equations  (528)  through  (533)  can  be  written  as 

W  ■  -  A  cos20  +  Z  cos20  +  N  cos2© 
a  z  n 

•»  Z(cos20  -  cos20  )  +  N(cos20  -  cos20  ) 
z  a  n  a 


(546) 
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(547) 


W.  ■>  -  A.  cos20  ,  +  Z  cos20  .  +  N.  cos20  , 

1  1  al  1  zl  1  nl 

*  Zj(cos20zl  "  cos20al)  +  Nl^cos2enl  "  cos2®al^ 


W2  -  -  A2  cos20a2  +  Z2  cos20z2  +  N2  cos20n2 

■  Z2(cos20z2  -  cos2©a2)  +  N2(cos28n2  -  cos20a2) 


(548) 


2  2  2 
W  =»  -  Z  cos  0  +  Z  cos  0  .  +  Z_  cos  0  _ 

z  z  1  zl  2  z2 

•  Z.(cos20  ,  -  cos20  )  +  Z_(cos20  »  -  cos28  ) 
I  zl  z  2  z2  z 


(549) 


2  2  2 
W  ■  -  N  cos  0  +  N.  cos  0  ,  +  N-  cos  0 

n  n  1  nl  2 


VVO  w  A 

n2 

2.  2, 


*  N, (cos  0  -  cos  0  )  +  N„(cos20  «  -  cos20  ) 

1  nl  n  2  n2  n 


(550) 


W  *  -  A  cos20  +  A.  cos20  ,  +  A-  cos20  - 
a  a  I  al  2  a2 

■  A^(cos20aj  -  cos20a)  +  A2(cos20a2  -  cos20a) 
A  comparison  of  equations  (540)  and  (545)  shows  that 


(551) 


fW  *  fWa 


(552) 


and  equations  (546)  and  (551)  give 

W  -  W  ■  Z  cos20  +  N  cos20  -  A,  cos20  ,  -  A_  cos20  - 
a  z  n  1  al  2  a2 


(553) 


Equation  (473)  shows  that  a  nucleus  in  a  state  of  incipient  fission  has  0a  'v  26z 
and  therefore  equation  (546)  gives  W  >  0  .  In  general  for  fission  in  an  exter¬ 
nal  field  where  the  internal  phase  angles  are  relatively  large 


W  >  0 

WL  >  0 

W2  >  0 

(554) 

W  <  0 

W  <  0 

W  <  0 

(555) 

When  the  external  electromagnetic  or  gravitational  field  is  shut  off  all  of 
the  W*s  have  zero  values.  However,  in  reality  external  fields  are  always 
present . 
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B.  Energy  Released  from  Nuclear  Fission  in  an  External  Field. 

The  Q  value  of  a  nuclear  reaction  is  a  measure  of  the  energy  released 
in  a  nuclear  fission  process.61  In  this  paper  a  complex  number  generalization 
of  the  standard  definition  of  the  Q  value  is  given  by 

Q/c2  =  M(A,Z)  -  M(A1.Z1)  -  M(A2,Z2) 

(556) 

where  as  in  equation  (361) 

M(A,Z)  =  nm^  +  zm^  -  B(A,Z)/c2 

(557) 

M(A1,Z1)  =  n1mn  +  z^  -  BCA^Z^/c2 

(558) 

M(A2,Z2)  =  n2mn  +  ~  B(A2,Z2)/c2 

(559) 

Then  the  Q  value  can  be  written  as 

Q  =  t(n  -  n1  -  n2)mn  +  (z  -  -  ^tAjc 

(560) 

+  BCApZj)  +  B(A2,Z2)  -  B(A,Z) 

Using  equation  (510)  allows  eauation  (560)  to  be  written  as 

Q  =  +  Q2 

(561) 

where 

Ql  .  -  (Wnmn  +  «A)C2 

(562) 

Q2  -  B(A1,Z1)  +  B(A2,Z2)  -  B (A,Z) 

(563) 

where  Wz  and  Wn  are  given  by  equations  (549)  and  (550)  respectively. 

Because  Wn  <  0  and  Wz  <  0  it  follows  that  for  subactinide  nuclei 

Qj  >  0 

(564) 

The  value  of  arises  from  the  rest  mass  terms  in  equations  (557)  through 

(562).  The  actual  rest  mass  is  unchanged  in  a  nuclear  fission  process  because 

Nmn  +  Zm^  -  (N1mn  +  Z^)  ~  (N2®n  +  Z2“h^  “  0 

(565) 

which  is  always  true  because  of  the  absolute  validity  of  baryon  number 
vation  which  for  the  present  case  is  written  as 

conser- 

Z  »  Zl  +  Z2  N  =  Nx  +  N2 

(566) 
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A  finite  value  of  Qi  results  from  the  special  form  of  the  conservation  law  of 
complex  baryon  numbers,  which  for  the  complex  atomic  number,  neutron  number 
and  atomic  mass  number  are  given  in  equations  (509)  and  (510).  The  nonzero 
value  of  Q|  does  not  represent  a  violation  of  the  law  of  baryon  number  conser¬ 
vation  but  instead  is  a  manifestation  of  the  broken  symmetry  of  the  atomic  num¬ 
ber  and  the  neutron  number  in  an  external  field.  Note  that  the  expression  for 
Ql  can  be  rewritten  using  equations  (549),  (550)  and  (562)  as 


0,/c2  =  -  m  fN, (cos20  ,  -  cos20  )  +  N„(cos^0  0  -  cos20  )] 
X1  n  1  nl  n  2  n2  n 

-  m^fZ^ (cos20z j  -  cos20z)  +  Z2(cos20z2  ~  cos^Q^)] 


(567) 


In  general  can  be  taken  to  be  a  small  number  and  can  be  neglected  compared 
to  the  value  of  Q2  •  For  zero  value  of  the  applied  external  field  Qj  =*  0  because 
all  internal  phase  angles  have  zero  values,  and  therefore  Wn  «  0  and  Wz  «  0  . 

The  value  of  Q2  can  be  calculated  by  combining  equations  (292)  and  (563). 
This  is  easily  done  for  symmetric  fission  and  under  the  approximation 


0  .  ^  0  „  **  0 
zl  z2  z 


0  .  ^  0  ,  'v  0 
nl  n2  n 


(568) 


For  symmetric  fission  equation  (563)  becomes 


Q2  =  2B(A/2  ,  Z/2)  -  B(A,Z) 


(569) 


Under  these  assumptions  the  value  of  Q2  is  given  by  the  following  complex  num¬ 
ber  generalization  of  the  standard  scalar  result6 1 


Q2  =  (1  -  21/3)ya2/3  +  (1  -  2"2/3)6i2/I1/3 
=  -  0.26yS2/3  +  0.376z2/a1/3 


(570) 


The  simple  form  in  equation  (570)  results  from  the  approximation  given  in  equa¬ 
tion  (568) .  The  value  of  Q  is  then  written  as 


Q  =  -  (Wnmn  +  WzmH)c2  -  0.26y52/3  +  0.376i2/I1/3 


(571) 


The  measured  value  of  Q  is  given  by  the  real  part  of  equation  (571) 

Q  =  -  (W  m  +  W  m.,)c2  -  0.26yA2^3  cos2/,30  cos(8  +  2/30  ) 
v  n  n  z  Tr  a  Y  a 


m 


+  0.376Z2A_1/3  cos20  cos~1/30  cos(6.  +  20,  -  1/30  ) 

z  a  0  z  a 


(572) 


Equation  (572)  can  be  compared  to  the  conventionally  calculated  value  of  Q 
which  is  given  by 


6 1 


Qc  -  -  0.26yA2/3  +  0.376Z2A_1/3 


(573) 
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For  the  case  of  zero  external  field  equation  (572)  reduces  to  equation  (573). 

A  condition  that  determines  the  possibility  of  the  final  fission  state 
to  occur  can  be  obtained  from  the  Q  value  for  the  nuclear  fission  process.61 
The  complex  number  generalization  of  this  condition  is 

Q  >  E*  (574) 

where  E*  *  complex  number  Coulomb  potential  energy  of  two  spherical  nuclei 
(Z/2.A/2)  in  geometrical  contact.  This  Coulomb  energy  can  be  written  as  a 
simple  complex  number  generalization  of  the  standard  scalar  result61 

E*  -  l/2e2(z/2)2/[b(a/2)1/3J  (575) 

-  2i/3(I/8) (5/3) 6z2/a1/3  -  0.262d52/a1/3 

where  as  before  in  equation  (329) 

Z  -  3/5e2/b  -  0.863/b  -  (0. 863/1. 523)k  MeV  (576) 

c 

where  b  *  complex  number  radius  parameter  given  by  equation  (283) .  For 
kc  »  1.35  fm_1  as  in  equation  (332)  it  follows  that 

o  -  0.765  MeV  =  765  keV  (577) 

and  0g  is  given  by  equation  (331). 

Combining  equations  (571),  (574)  and  (575)  gives  the  final  state  fission 
energy  condition  as 

(W  m  +  W  mu)c2  +  0.26ya2/3  -  0.11622/I1/3  (578) 

n  n  z  H 

This  equation  can  be  used  instead  of  the  incipient  fission  condition  given  in 
equation  (452)  to  determine  9Z  and  0a  .  However  because  of  the  presence  of  the 
functions  Wn  and  Wz  the  full  set  of  thirteen  equations  (511)  through  (523)  must 
be  solved  in  conjunction  with  the  two  components  of  equation  (578)  which  are 

(W  m  +  W  mjc2  +  0.26yA2/3  cos2/38  cos(8  +  2/38  )  (579) 

n  n  z  H  a  y  a 

-  0.116Z2A-1/3  cos26  cos_1/30  cos(8,  +  28  -  1/38  ) 

z  a  o'  z  a 

0.26yA2/3  cos2/30a  sin^  +  2/38a)  (580) 

-  0.115Z2a“1/3  cos28  cos“1/38  sin(8x  +  28  -  1/30  ) 

z  sl  6  z  a 

If  Wn  and  W2  are  neglected  in  equation  (578)  then  the  final  state  fission  con¬ 
dition  can  be  written  as 
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*c  * y/5 


2.36 


(581) 


Equation  (581)  is  the  same  form  as  the  incipient  fission  condition  given  in 
equation  (451)  and  the  same  form  of  solution  for  e|  and  6|  that  appears  in  equa¬ 
tions  (462)  and  (480)  can  now  be  used  to  determine  these  phase  angles  for  the 
final  state  fission  condition.  Then  the  remaining  thirteen  equations  (511) 
through  (523)  can  be  used  to  calculate  the  remaining  thirteen  functions  listed 
in  equations  (524)  through . (527) . 

7.  SUSTAINED  y  RAY  CATALYZED  THERMAL  NEUTRON  INDUCED  CLEAN  FISSION  NU¬ 
CLEAR  REACTIONS.  This  section  presents  numerical  calculations  of  the  internal 
phase  angle  of  the  atomic  number  6^  that  is  required  for  the  external  field 
catalysis  of  spontaneous  or  thermal  neutron  induced  fission  nuclear  reactions 
in  nuclei  lighter  than  the  actinides.  The  corresponding  strengths  of  the  static 
magnetic  field  and  electromagnetic  wave  field  required  to  catalyze  spontaneouf 
or  thermal  neutron  induced  fission  in  subactinide  nuclei  are  also  obtained.  Un¬ 
der  ordinary  conditions  these  relatively  light  nuclei  are  not  fissile  for  inci¬ 
dent  thermal  neutrons.  In  fact,  under  ordinary  circumstances  only  some  of  the 
actinides  sustain  fission  by  thermal  neutrons,  for  example  23SU  and  2  39Pu  are 
fissile  for  incident  thermal  neutrons  but  238U  and  232Th  are  not.  Higher  inci¬ 
dent  neutron  energies  will  induce  fission  in  all  of  the  actinides,  and  in  fact 
light  elements  will  undergo  neutron  induced  fission  for  sufficiently  high  ki¬ 
netic  energy  of  the  incident  neutrons  -  the  nuclei  are  simply  blown  apart.  For 
nuclear  reactors  requiring  sustained  fission  reactions,  however,  thermal  neu¬ 
tron  induced  fission  reactions  are  required  so  that  at  present  only  actinide 
elements  such  as  235U  and  239Pu  can  be  used.  The  fission  products  of  these 
heavy  nuclei  are  dangerous  radionuclides. 

The  fission  product  nuclei  from  conventional  actinide  element  nuclear  re¬ 
actors  are  radioactive  because  they  have  large  neutron  excesses  and  are  far  re¬ 
moved  from  the  valley  of  beta  stability.  The  primary  radiations  from  conven¬ 
tional  nuclear  reactor  fission  products  are  beta  decays,  alpha  decays  and  neu¬ 
tron  emissions  which  occur  as  the  nuclei  move  toward  the  valley  of  beta  stabi¬ 
lity.  The  only  way  to  have  clean  low  energy  fission  is  to  have  fission  product 
nuclei  that  are  close  to  the  valley  of  beta  stability,  and  the  only  practical 
way  to  achieve  this  for  nuclear  reactors  is  to  have  thermal  neutron  induced 
fission  in  nuclei  lighter  than  the  actinides.  Section  5  of  this  paper  showed 
that  this  is  possible  if  the  subactinide  nuclei  are  immersed  in  an  electromag¬ 
netic  field.  Many  other  surprising  effects  occur  when  atoms  are  immersed  in  an 
electromagnetic  field.31-90  This  section  develops  the  electromagnetic  field 
criteria  that  are  necessary  for  catalyzing  the  clean  fission  of  relatively  light 
subactinide  nuclei  using  thermal  neutrons,  and  in  particular  it  is  shown  that 
a  y  ray  field  is  required.  The  fission  products  of  these  reactions  are  either 
not  radioactive  or  exhibit  only  low  level  emissions.  Examples  of  the  clean 
fission  reactions  for  subactinide  nuclei  are  presented. 

F 

A.  Numerical  Values  of  the  Internal  Phase  Angle  9Z  Required  for  the 
Low  Energy  Clean  Fission  of  Subactinide  Nuclei. 

F 

The  general  expression  for  02  associated  with  the  fission  of  a  subacti¬ 
nide  nucleus  (A,Z)  requires  Qy  and  0$  as  input  parameters  as  shown  in  equation 
(462).  These  two  mass  formula  parameters  can  only  be  determined  by  fitting  the 


523 


broken  symmetry  form  of  the  liquid  drop  nuclear  mass  formula  given  by  equations 
(347)  and  (369)  to  measured  nuclear  mass  data.  As  this  was  not  done,  the  as¬ 
sumption  8^  =  0g  is  made  and  only  the  approximate  expression  for  6|  given  by 
equation  (483)  is  used  for  numerical  analysis.  Table  1  gives  the  values  of  e| 
required  for  thermal  neutron  induced  fission.  The  results  in  Table  1  are  cal¬ 
culated  for  tc  =  1.471  ,  y  =  17.2  MeV  ,  6  =  0.698  MeV  and  <y/&  =  36.25  which  enters 
the  fissility  parameter  calculation  in  equation  (445).  The  average  kinetic  en¬ 
ergy  of  the  thermal  neutrons  is  =  0.025  eV  . 

The  nuclei  in  Table  1  are  arranged  by  decreasing  values  of  the  fissility 
parameter.  Those  actinide  nuclei  for  which  thermal  neutron  induced  fission  oc¬ 
curs  without  the  need  of  an  external  electromagnetic  field  are  indicated  by 
zero  values  of  6^  .  The  fact  that  there  is  no  sharp  boundary  between  nuclei 
that  can  be  fissioned  with  thermal  neutrons  and  those  that  cannot  shows  that 
the  Bohr-Wheeler  fissility  parameter  is  a  collective  property  of  a  nucleus  and 
does  not  completely  describe  the  nuclear  fission  process  because  nuclear  shell 
effects  play  an  important  role.  For  a  gross  description  of  nuclear  fission  the 
fissility  parameter  is  adequate.  Table  1  shows  that  the  angle  0^  is  relatively 
small  for  the  actinide  nuclei  that  require  an  external  electromagnetic  field  to 
undergo  fission  by  thermal  neutrons  such  as  2  3  4U  ,  2  3  3Pa  ,  2  3  8U  and  2  3  2Th  ,  so  that 
any  experimental  confirmation  of  electromagnetically  catalyzed  clean  fission 
would  be  easiest  for  these  actinide  nuclei  unless  strong  nuclear  shell  effects 
enter  to  prevent  low  energy  fission. 

B.  Static  Magnetic  Field  Required  to  Catalyze  Low  Energy  Clean  Fission 
in  Subactinide  Nuclei. 


The  determination  of  the  static  magnetic  field  necessary  to  catalyze  clean 
fission  in  nuclei  lighter  than  the  actinides  using  thermal  neutrons  requires 
values  of  the  magnetic  stiffness  coefficients  K®z  amd  or  equivalently  the 
magnetic  compliance  coefficients  c|z  and  cjjz  that  are  defined  in  equations  (185) 
through  (214).  The  values  of  these  coefficients  can  be  calculated  by  requiring 
the  equality  of  the  potential  energy  density  of  elastic  shearing  in  internal 
space  and  the  magnetic  energy  density  of  the  applied  field.  This  equality  is 
written  as 


1/2G02  =  B2/(2y)  -  l/2pH2 
z 

or  as  in  equations  (209)  through  (212) 


0 


,B 

0z 

B 

“-4 

’0z 

H 

where  from  equation  (582) 


'6z  * 

cH  = 
Sz 

(li/G)  1/2 

5.  -  <^>1/2 

kh  = 

K0z 

(G/u)i/2 
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(582) 


(583) 

(584) 


(585) 


(586) 


where  G  and  y  »  shear  modulus  and  magnetic  permeability  of  an  atomic  nucleus, 
and  B  and  H  =  magnitudes  of  the  magnetic  induction  vector  and  magnetic  field 
vector  respectively.  The  values  of  B  and  H  required  to  fission  a  subactinide 
nucleus  are  then  given  by  equations  (583),  (584)  and  (586)  as 


SF  =  Kj  0F  =  (Gy) 1/2  0F 
oz  z  z 


hf  -  k“  eF  -  <gA,)1/2  eF 

DZ  z  z 


(587) 

(588) 


TJ* 

so  that  generally  if  equation  (462)  is  used  for 
BF  =  BF(8y  ,  0^  ,  X  ,  G  ,  y) 

or  more  simply  if  equation  (483)  is  used  to  calculate 
bf  =  BF(x  ,  g  ,  y) 


(589) 


(590) 


where  x  *  fissility  parameter  given  by  equation  (445).  The  units  of  the  rele¬ 
vant  physical  quantities  used  in  this  analysis  are  given  by 


[H]  =  amp/m  *  coul/(m  sec) 

2  ^ 

[B]  *  T  -  Wb/m  =  kg/(sec  coul)  =  10  gauss  =  N  sec/(m  coul) 

[G]  *  N/m2 

2  2  2 
[y]  =»kg  m/coul  *  Henry/m  =  Wb/(amp  m)  *  N  sec  /coul 

[C*  ]  =  rad  m  sec /coul  **  rad  m/amp 

DZ 

B  2 

[C“  ]  55  rad/T  53  rad  sec  coul /kg  =  rad  m  /Wb 
“2 

ii 

[K„  ]  *  coul/(rad  m  sec)  =  amp/(rad  m) 

0z 

B  2 

[Kq  ]  *  T/rad  *  kg/ (rad  sec  coul)  *  Wb/ (rad  m  )  *  N  sec /(rad  m  coul) 

OZ 

where  T  =  tesla. 

Equation  (587)  shows  that  the  value  of  the  magnetic  induction  field  BF 
required  for  clean  fission  depends  on  the  value  of  the  magnetic  permeability 
of  nuclear  matter.  The  value  of  y  refers  to  neutrons  and  protons  in  nuclear 
matter  at  nuclear  density  which  corresponds  to  kc  v  1.35  fm~l  .  The  vacuum 
value  of  y  is  given  by91'92 

Pq  =  4it  x  10  2  kg  m/coul2  (591) 

The  value  of  y  corresponding  to  nuclear  matter  is  given  by91*92 

y  »  yQ(i  +  Xjjj)  (592) 


525 


(593) 


where  the  magnetic  susceptibility  xm  of  nuclear  matter  is  given  by 

Xm  =  nc6  “  nc(BP  +  ^ 

where  nc  ■  central  nucleon  number  density  of  an  atomic  nucleus  which  is  approx¬ 
imately  equal  to  the  saturation  nucleon  number  density  of  infinite  nuclear  mat¬ 
ter  and  is  given  by60’68 

n  *  2/(3w2)k3  -v  0.166  fnf3  (594) 

c  c 

and  3P  «*  magnetic  polarizability  of  the  proton  and  Bn  “  magnetic  polarizability 
of  the  neutron  which  are  given  by 9 3,9 14 

Bp  'V  gn  -  3.34  X  10~4  fm3  (595) 

Equations  (593)  through  (595)  give  the  magnetic  susceptibility  of  nuclear  mat¬ 
ter  at  nuclear  density  a  value  of 

X_  -  1.11  x  10'4  (596) 

m 

so  that  Xm  <<:  (  and  equation  (592)  gives  11  'v  ^  ,  and  therefore  the  vacuum  value 
of  the  magnetic  permeability  is  adequate  for  calculating  the  magnetic  properties 
of  nuclear  matter  at  nuclear  density. 

Equation  (587)  shows  that  the  value  of  the  magnetic  induction  field  BF 
required  for  the  electromagnetic  catalysis  of  thermal  neutron  induced  clean 
fission  of  elements  lighter  than  the  actinides  depends  on  the  shear  modulus  of 
nuclear  matter.  The  shear  modulus  of  nuclear  matter  is  easily  obtained  from 
the  value  of  the  bulk  modulus  (incompressibility)  and  Poisson's  ratio  of  nuclear 

matter  as  follows95 

G  -  3(1  -  2v)K[2(l  +  v)]~l  (597) 

where  G  ,  v  and  K  *  shear  modulus,  Poisson's  ratio  and  bulk  modulus  of  nuclear 
matter.  The  values  of  Poisson's  ratio  usually  have  a  range  of  values  0  <  v  <  1/2, 
so  that  a  value  v  ■  1/4  is  adopted  in  this  paper.  The  bulk  modulus  is  given  by 
the  following  standard  formula95 

K  -  ndP/dn  (598) 

*  n(2ndc/dn  +  fe) 

where  n  -  nucleon  number  density,  P  -  pressure,  e  ■  average  energy  per  nucleon 
and  fe  »  incompressibility  parameter  given  by68’96 

fe  -  n2d2e/dn2  (599) 

At  the  saturation  density  which  occurs  at  the  minimum  average  energy  per  nucleon 
de/dn  -  0  ,  and  equation  (598)  gives  the  bulk  modulus  as 

K  ■  nfe  -  n3d2e/dn2  (600) 
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For  nuclear  matter  with  Z/N  *  1  the  following  are  commonly  used  values  of  the 
saturation  nucleon  number  densi  y  and  the  incompressibility  factor68*96 


n  ”  0.166  nucleons/fm' 
k  =  250  MeV/nucleon 


(601) 

(602) 


Equation  (602)  gives  a  relatively  high  value  for  the  incompressibility  factor 
so  that  the  author  will  not  be  accused  of  ignoring  the  hard  facts. 

Combining  equations  (597)  and  (600)  through  (602)  gives  the  bulk  modulus 
and  shear  modulus  of  Z  =  N  nuclear  matter  at  nuclear  density  as 


K  =  41.5  MeV/fm3  =>  6.64  x  1033  N/m2 
G  -  24.9  MeV/fm3  -  3.98  x  1033  N/m2 


(603) 

(604) 


Also,  at  equilibrium  the  average  binding  energy  for  infinite  nuclear  matter  and 
for  an  atomic  nucleus  with  N  *  Z  are  respectively 


e  "v  15.5  MeV/nucleon 


infinite  nuclear  matter 


(605) 


e  ^  8.0  MeV/nucleon 


atomic  nucleus 


while  the  average  Coulomb  energy  per  nucleon  is68 


£  ^  0.765  MeV/nucleon 

c 


(606) 


(607) 


For  purposes  of  comparison  with  chemically  bound  systems  it  should  be  pointed 
out  that  the  shear  modulus  of  steel  is95 


G  .  %  4  x  109  N/m2 
steel 


(608) 


while  the  binding  energy  (electronegativity)  of  a  valence  electron  of  a  chem¬ 
ical  element  typically  has  a  value9 


e  .  'v  10  eV/electron 
chem 


(609) 


A  comparison  of  equations  (605) ,  (606)  and  (609)  shows  that  the  average  binding 
energy  per  particle  in  a  nuclear  system  is  about  6  orders  of  magnitude  larger 
than  the  average  binding  energy  per  electron  in  an  atomic  system.  However,  a 
comparison  of  equations  (604)  and  (608)  shows  that  the  bulk  modulus  of  nuclear 
matter  at  nuclear  density  is  about  24  orders  of  magnitude  larger  than  the  bulk 
modulus  of  a  chemical  compound  at  its  equilibrium  density.  The  very  large 
difference  in  the  values  of  the  bulk  moduli  of  nuclear  and  chemical  systems  is 
due  to  the  large  value  of  the  nuclear  matter  density  which  enters  directly  into 
the  calculation  of  the  bulk  modulus  as  shown  in  equation  (600) .  The  values  of 
the  nuclear  binding  energy,  saturation  density  and  bulk  modulus  actually  depend 
on  the  values  of  the  neutron  excess  of  nuclear  matter,  but  this  is  not  consid¬ 
ered  in  this  paper.68 


The  magnetic  compliance  coefficients  for  nuclear  matter  in  a  static  mag- 
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netic  field  can  be  calculated  from  equations  (585),  (591)  and  (604),  and  the 
results  are 

C®z  -  1.41  X  10"U  rad/T  =  8.08  x  10'13  deg/T  (610) 

H  —20 

Cgz  »  1.78  x  10  rad  m  sec/coul  (611) 

where  T  •  tesla.  The  magnetic  shear  stiffness  coefficients  can  be  calculated 
from  equations  (586) ,  (591)  and  (604)  or  more  simply  as  the  reciprocals  of  the 
magnetic  compliance  coefficients  given  in  equations  (610)  and  (611)  with  the 
result  that 

K®z  *  7.07  x  1013  T/rad  -  1.23  X  1012  T/deg  (612) 

Kgz  ■  5.63  x  10^  coul/(rad  m  sec)  (613) 

The  values  of  the  magnetic  compliance  and  magnetic  shear  stiffness  coefficients 
that  have  been  calculated  refer  to  a  static  magnetic  field  because  the  shear 
modulus  used  in  these  calculations,  and  given  by  equation  (604),  is  a  measure 
cf  Lhe  static  deformation  of  nuclear  matter  in  internal  space  under  the  appli¬ 
cation  of  a  static  magnetic  field  as  described  by  equation  (582). 

The  values  of  the  static  magnetic  induction  field  required  for  clean  fis¬ 
sion  can  be  calculated  from  equations  (587),  (612)  and  the  values  of  0F  given 
by  equation  (483).  The  values  of  0F  and  B*  required  for  the  clean  fission  of 

some  nuclei  of  interest  appear  in  Table  1.  From  Table  1  it  is  obvious  that 

very  large  values,  ^  40  teratesla,  of  the  static  magnetic  induction  field  BF 
are  required  to  catalyze  clean  fission  nuclear  reactions  in  the  subactinide  nu¬ 
clei.  These  values  of  BF  are  much  larger  than  any  static  magnetic  field  that 
has  been  obtained  in  the  laboratory.  Typical  static  magnetic  fields  used  in 
particle  physics  experiments  are  about  4-6  T.98  The  highest  static  magnetic 
field  obtained  in  the  laboratory  is  about  50  T.99*100  Therefore  the  predic¬ 
tion  of  static  magnetically  catalyzed  clean  fission  of  subactinide  nuclei  by 
thermal  neutrons  cannot  be  verified  in  the  laboratory  with  present  day  technol¬ 
ogy.  Explosive  compression  generated  magnetic  fields  have  values  that  exceed 
several  kilotesla  but  fall  far  short  of  the  10^  tesla  field  required  for  prac¬ 
tical  clean  fission  reactions.101*104  The  huge  values  of  the  static  magnetic 
field  required  for  the  catalysis  of  clean  fission  of  nuclei  lighter  than  the 
actinides  suggests  that  it  is  impractical  to  design  a  clean  fission  nuclear 
reactor  that  uses  a  static  magnetic  field  as  a  fission  catalyst.  However,  the 
situation  is  not  hopeless  because  it  is  shown  in  the  following  sections  that 
it  may  be  possible  to  catalyze  thermal  neutron  induced  clean  fission  reactions 
in  the  lighter  elements  by  using  y  rays. 

C.  Electromagnetic  Wave  Resonance  Catalysis  of  Thermal  Neutron  Induced 
Clean  Fission  Reactions  for  Nuclei  Lighter  than  the  Actinides. 

The  large  static  magnetic  fields  predicted  to  be  required  for  clean  fis¬ 
sion  reactions  of  subactinide  nuclei  using  thermal  neutrons  can  be  circumvented 
by  using  high  frequency  time  dependent  electromagnetic  fields.  Nature  is  full 
of  surprises,  and  it  will  now  be  shown  that  the  superfluid  nature  of  nuclear 
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matter  suggests  that  using  electromagnetic  waves  (y  rays)  that  are  tuned  to  a 
critical  frequency  for  each  subactinide  element  to  be  fissioned  requires  a  mag¬ 
netic  vector  component  B^-  which  is  of  the  order  of  magnitude  By  20  kilotesla 
which  is  much  smaller  in  value  than  the  static  field  value  ^  40  teratesla 
and  can  be  obtained  in  laboratory  y  ray  fluxes  (Table  2) . 

Consider  first  a  static  field  loading  on  an  atomic  nucleus.  The  spring 
constant  of  an  atomic  nucleus  can  be  calculated  in  terms  of  the  shear  modulus, 
Poisson's  ratio  and  radius  of  the  nuclear  matter  in  a  nucleus.  A  cylinder  of 
nuclear  matter  of  length  h  and  radius  R  has  a  spring  constant  given  by9S»10S 

k  =  EirR2/h  (614) 

where  k  =  spring  constant,  E  =  Young's  modulus,  R  =  radius  of  cylinder  and  h  => 
length  of  cylinder.  The  Young's  modulus  can  be  expressed  in  terms  of  the  shear 
modulus  and  Poisson's  ratio  as  follows95 

E  -  2(1  +  v)G  (615) 

where  v  »  Poisson's  ratio.  Now  consider  a  cylinder  whose  length  is  taken  ac¬ 
cording  to  the  condition  that  the  volume  of  the  cylinder  is  equal  to  the  volume 
of  a  spherical  nucleus  of  the  same  radius 

irR2h  =  4/3-irR3  (616) 

which  gives 

h  =  4/3R  (617) 

Combining  equations  (614),  (615)  and  (617)  gives  the  spring  constant  of  an  atomic 
nucleus  as 

k  -  3/2ir(l  +  v)GR  (618) 

Combining  equations  (282)  and  (618)  gives 

k  =  3/2irb(l  +  v)GA1/3  (619) 

where  the  effect  of  the  internal  phase  angle  0a  is  neglected  in  equation  (285) , 
and  where  b  =  constant  given  in  equation  (282) .  Values  of  k  for  various  nuclei 
appear  in  Table  2. 

For  a  damped  vibrating  spring  the  impedance  is  given  by 

S  *  [  (k  -  Meff“2)2  +  C2(u2] 1/2  (620) 

where  S  =  impedance,  w  -  circular  frequency  of  vibration  of  a  nucleus,  Mgff  m 
effective  dynamic  mass  of  a  nucleus,  and  C  ■  damping  constant.  The  effective 
mass  of  a  nucleus  is  given  by 

M  -c  =  l/3m  A  =  1/3 (Zm  +  Nm  -  B/c2)  (621) 

err  av  p  n 
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where  the  factor  1/3  enters  equation  (621)  because  the  mass  is  distributed 
throughout  the  nucleus  so  that  the  spring  itself  is  massive.105  The  average 
nucleon  mass  that  appears  in  equation  (621)  is  given  by 


2 

m  "  Z/Am  +  N/Am  -  e/c 
av  p  n 

'v  l/2(m  +  m  )  -  e/c^ 

P  n 

'v  l/2(m  +  m  )  'v  m  m 

p  n  p  n 


(622) 


(623) 


where  c  *  light  speed,  e  «  average  binding  energy  per  nucleon  given  by  equation 
(291),  mp  *  proton  mass  and  *  neutron  mass.  The  dynamics  calculation  done 
in  this  section  neglects  the  effects  of  the  binding  energy  on  the  nuclear  mass 
and  uses  equation  (623)  for  the  average  mass  per  nucleon.  The  approximation 
that  mav  ■  constant  makes  the  effective  mass  in  equation  (621)  a  linear  function 
of  the  atomic  mass  number  and  leads  to  simple  approximate  formulas  for  the  reso¬ 
nance  frequency  and  energy.  The  damping  constant  that  appears  in  equation  (620) 
is  given  by 

C  =  nR  -  nbA1/3  (624) 


where  n  ■  viscosity  of  nuclear  matter.  The  value  of  the  viscosity  of  nuclear 
matter  is  obtained  by  assuming  that  it  has  the  same  value  as  the  viscosity  of 
superfluid  3He  namely  n  *  2  x  10“?  kg/(m  sec).106  The  dimensions  of  the  physi¬ 
cal  quantities  that  appear  in  equations  (614)  through  (624)  are  given  by 

[R]  -  m 

[b]  -  m 

[G]  =»  N/m^  ®  kg/(m  sec^) 

[k]  *  N/m  *  kg/sec^ 

[SJ  ■  N/m  =  kg/sec^ 

[Meff]  -  [mav]  -  [mp]  -  [mn]  -  kg 
[C]  ■  N  sec/m  ■  kg/sec 
[n]  *  kg/ (m  sec) 

[o>]  ™  Hz  ■  sec  * 

The  impedance  function  given  by  equation  (620)  has  a  local  minimum  value  at  a 
frequency  which  corresponds  to  the  maximum  vibration  amplitude,  and  satisfies 
S  -*■  k  when  u>  -►  0  . 

The  simplest  way  of  incorporating  the  dynamical  response  of  a  nucleus  to 
electromagnetic  waves  into  the  formalism  given  by  equation  (587)  for  calculating 
the  magnetic  induction  field  required  for  the  clean  fission  of  the  subactinides 
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is  to  introduce  an  effective  dynamic  shear  modulus  so  that  in  the  presence  of 
electromagnetic  waves  the  definition  of  a  dynamic  magnetic  stiffness  modulus 
for  an  atomic  nucleus  is  given  in  analogy  to  equations  (582)  and  (586)  by 

1/2GV  =  B'2/ (2y)  +  1/2eE' 2  -  B2/(2y)  =  (K^)202/(2p)  (625A) 

K8z  =  ^gY)1^2  tesla/deg  (625B) 


v 

where  G  =  effective  dynamic  shear  modulus  which  is  obtained  from  equations 
(618)  and  (620)  as 

GY  =  2/ (3n) (1  +  v)_1S/R  N/m2  (626) 


In  the  limit  of  w  -*■  0  it  follows  from  equations  (620)  and  (626)  that  GY  -*■  G  , 
so  that  GY  is  an  effective  dynamic  shear  modulus  which  has  the  correct  static  limit. 
The  ratio  of  interest  for  clean  fission  nuclear  reactor  design  is  therefore 


c  -  KeXz  ’  <gY/g>1/2 


bW  -  [(B^)2  +  ey(EF')2]1/2/BF 


(627) 


where  GY  is  a  function  of  several  parameters  GY  =  GY(G,v,w,C,b,A)  .  At  low 
frequencies  z;  1  .  Here  By’  ^  b|7/2  and  ^  E^//2  =  B^/(2ey)l/2  are  the 
radiation  field  components  required  for  fission. 


The  impedance  function  S  for  an  atomic  nucleus  has  a  frequency  dependence 
shown  in  Figure  2.  The  ratio  ^  of  the  magnetic  stiffness  coefficients  given  in 
equation  (627)  also  has  this  frequency  dependence  and  is  shown  in  Figure  3.  The 
impedance  S  has  a  minimum  value  at  the  giant  dipole  resonance  frequency,  and  at 
this  frequency  the  dynamic  magnetic  stiffness  coefficient  and  the  ratio  £ 
have  much  smaller  values  than  their  corresponding  static  values.  The  reason  for 
this  can  be  discerned  from  the  impedance  equation  (620)  which  shows  that  the 
giant  dipole  resonance  frequency  is  given  by105 

1  /2 

cor  =  <>c(k/Meff)i/Z  fr  =  u>r/(2ir)  (628) 


where  ac  =  model  correction  factor  which  is  introduced  to  account  for  the  fact 
that  a  nucleus  is  not  really  described  by  a  simple  linear  spring  model.  In 
this  paper  the  correction  factor  is  chosen  to  have  the  value  ac  ■  0.5412,  but 
a  different  choice  for  the  value  of  the  shear  modulus  of  nuclear  matter  will 
yield  a  different  value  for  ac  in  order  to  agree  with  measured  giant  dipole 
resonance  frequencies.  Combining  equations  (619),  (620)  and  (628)  gives 

f  -  ac(2TT)_1[9/2Trb(l  +  v)G/raav]1/2  A-1/3  Hz  (629) 


where  raav  is  given  by  equation  (623).  For  atomic  nuclei  the  giant  dipole  reso¬ 
nant  frequency  is  in  the  y  ray  region  of  the  electromagnetic  spectrum.  The 
value  of  the  impedance  at  the  giant  dipole  resonance  frequency  is  obtained  from 
equations  (620),  (624)  and  (629)  to  be 

1  /2 

S  -  Cw  -  a  C(k/M  ,,)  '  N/m  (630) 

r  r  c  eff' 

-  a  nb(9/2frb(l  +  u)G/m  ]^2 
c  av 
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and  is  seen  to  be  essentially  independent  of  the  atomic  mass  number  A  because 
equation  (623)  shows  that  mav  is  roughly  independent  of  A  .  The  damping  con¬ 
stant  C  given  by  equation  (624)  was  used  to  obtain  the  result  in  equation  (630) 
The  value  of  the  dynamic  shear  modulus  at  the  giant  dipole  resonance  frequency 
is  obtained  from  equations  (282),  (626)  and  (630)  to  be 


GY  =  2/ (3tt)  (1  4-  v)  1Sr/R  N/m2 


(631) 


=  2/ (3*) (1  +  v)_1a  n[9/2nb(i  +  v)G/m  ]1/2  A~1/3 


=  o2GA'1/3 

where  the  dimensionless  number  a  is  essentially  independent  of  A  and  is  given  by 


o  -  (GjA1/3/G)1/2 

=  [2/(3*) (1  +  v)_1Sr/(Gb)]1/2 

=  (2bnV/[™  G(1  +  v)]}1^ 


(632) 


=  [4bn2a2/(*m  E)]*^ 

c  av 


where  b  is  defined  in  equation  (282) .  The  dimensionless  parameter  o  has  the 
value  o  =  1.018  x  10~9  .  Then  equation  (631)  gives 


GY/G  =  a2A~1/3  -  S 


(633) 


and  combining  equations  (627)  and  (633)  gives  the  following  relationship  which 
is  valid  at  the  giant  dipole  resonance  frequency 


4  =  [K®Y/K®  ]  -  (GY/G)1/2 

r  0z  9z  r  r 


<sr/k)I/2  -  0A-"6 


(634) 


The  values  of  £r  for  various  nuclei  appear  in  Table  2.  From  equation  (627)  it 
follows  that  the  dynamic  magnetic  stiffness  coefficient  at  the  giant  dipole 
resonance  frequency  is  given  by 


K6z  ■  mL 


tesla/deg 


(635) 


The  dimensionless  number  o  is  still  a  function  of  G  .  The  values  of  Kq£  at  the 
giant  dipole  resonance  frequency  appear  in  Table  2  for  various  atomic  nuclei, 
and  Table  2  shows  that  the  values  of  are  significantly  smaller  than  the 
value  of  Kqz  given  in  equation  (612). 

For  electromagnetic  waves  that  are  tuned  to  the  y  ray  frequencies  of  the 
giant  dipole  resonance  frequencies  of  the  subactinide  nuclei,  the  required  mag- 
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netic  induction  field  for  the  clean  fission  of  subactinide  nuclei  using  thermal 
neutrons  is  calculated  in  analogy  to  equation  (587)  as 


9F 
0Z  z 


(ugy)1/2  eF 

r  z 


tesla 


-„a-1/6k'  eF 

0Z  Z 


oA_1/6(hG)1/2  0F 

Z 


(636) 


-  oA-1/6  BF 


where  £r  is  given  by  equation  (634) .  The  values  of  By  for  selected  nuclei  are 
given  in  Table  1  and  are  seen  to  be  much  smaller  than  the  values  of  which 
also  appear  in  Table  1.  The  corresponding  magnetic  field  strength  of  the  reso¬ 
nance  electromagnetic  wave  required  for  clean  fission  is  given  by 


HF  =  BF/vi  =  (GY/g)1/2 
y  y  r 


aF  *-1/6  „F  . 

0z  =  aA  H  amp/m 


(637) 


t(HF’)2  +  e/u(EF')2]1/2 


where  p  pQ  =  magnetic  permeability  of  nuclear  matter.  The  corresponding  val¬ 
ues  of  the  electric  field  of  the  resonance  electromagnetic  waves  required  for 
the  clean  fission  of  subactinide  nuclei  are  given  by91*92 

EF  =  (p  /e  )1/2  HF  =  120irHF  volts/m  (638) 

Y  o  o  y  Y 

where  ^  HF/ /2  =  BF/(/2 p)  and  E^  ^  EF//2  give  the  radiation  field  components. 
The  power  density  of  the  resonance  electromagnetic  waves  that  are  required  for 
the  clean  fission  of  subactinide  nuclei  using  thermal  neutrons  is91’92 

PF  =  l/2(po/eo)1/2(HF)2  W/m2  (639) 

«  1/2(120tt)(BF/po)2 

-  1/2(120tt)(o2G/p  )A~1/3(0F)2 
o  z 

where  0?  «  critical  internal  angle  of  the  complex  atomic  number  given  by  equa¬ 
tion  (483)  and  corresponding  to  incipient  fission  of  a  subactinide  nucleus. 

The  y  photon  energy  required  to  catalyze  clean  fission  of  the  subactinides  can 
be  obtained  from  equation  (629)  to  be 

eF  =  hvF  =  hf^  MeV/photon  (640) 

where  h  «  Planck's  constant  and  has  the  value106 

h  *  6.626  x  10  Joule  sec  (641) 

-  4.136  x  10'21  MeV  sec 


The  y  ray  flux  density  required  to  catalyze  clean  fission  of  subactinide  nuclei 
using  thermal  neutrons  is  obtained  from  the  y  ray  power  density  given  in  equa¬ 
tion  (639)  as  follows 


533 


PF/eF 
Y  Y 


photons/ (sec  tn  ) 


(642) 


The  y  ray  photon  number  density  required  for  clean  fission  is  obtained  from  the 
flux  density  given  in  equation  (642)  by 

F  F  3 

nY  “  4^/c  photons/m  (643) 

where  c  *  light  speed  given  by106 

c  ■  2.9979  x  10®  m/sec  (644) 

F  F  F  F  F  F 

The  quantities  Hy  ,  Ey  >  Py  *  eY  *  ^Y  anc^  nY  evaluated  at  the  giant  dipole  resonance 
frequency  appear  in  Table  3. 


The  mechanical  and  electromagnetic  quantities  that  are  relevant  to  the  clean 
fission  of  subactinide  nuclei  depend  of  the  value  chosen  for  Poisson's  ratio  as 
for  example  in  equation  (632).  The  value  of  Poisson's  ratio  for  nuclear  matter 
is  not  known,  but  it  is  generally  in  the  range  of  values  0  <  v  <  0.5. 95  Nega¬ 
tive  values  of  Poisson's  ratio  are  possible  but  only  in  very  exotic  materials 
and  will  not  be  considered  here.  Because  the  physical  quantities  calculated  in 
this  section  are  not  particularly  sensitive  to  the  choice  of  value  for  Poisson's 
ratio,  the  choice  v  =■  1/4  is  made  for  the  numerical  calculations.  Then  the 
elastic  properties  of  nuclear  matter  are  taken  from  equation  (604)  to  be 

G  =■  3.98  x  10®®  kg/(m  sec®)  v  *  1/4 


Nuclear  matter  is  a  viscous  superfluid  whose  viscosity  is  assumed  to  have  the 
same  value  as  does  superfluid  liquid  3He  . 106  Then  the  following  values  of  nu¬ 
clear  properties  are  used  to  produce  Tables  1  through  3: 


m 


av 

b 

n 

a 

3r 

k 


r 

F 


1.6738  x  10"27  kg 

(646) 

1.2  fm 

(647) 

2.0  x 

,o-7  .  -1  -1 

10  kg  m  sec 

(648) 

1.018 

x  10-9 

(649) 

29.17 

N  m.  ^ 

(650) 

28.10 

1n18 
x  10 

A1/3  N  m"1 

(651) 

19.34 

x  1021 

A‘1/3  Hz 

(652) 

15.50 

A1/3 

fm 

(653) 

80.0  A_1/3 

MeV /photon 

(654) 

Values  of  k  ,  fr  and  Xr  appear  in  Table  2,  while  Ey  appears  in  Table  3.  The 
Y  ray  photon  number  density  n^  ,  flux  density  $y  and  power  density  P^  required 
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p 

for  fission  depend  on  Z  and  A  through  the  fission  angle  62(Z,A)  as  shown  in 

equation  (639) .  The  value  of  the  impedance  at  resonance  Sr  is  independent  of 

A  .  Note  that  k  is  18  orders  of  magnitude  larger  than  Sr  which  corresponds  to 

the  fact  that  is  9  orders  of  magnitude  larger  th^n  as  seen  from  Table  1. 

The  relatively  low  values  of  Sr  and  is  due  to  the  fact  that  at  the  resonance 
frequency  it  is  the  small  value  of  the  superfluid  viscosity  of  nuclear  matter 
that  determines  the  value  of  the  impedance. 

2 

Table  i  gives  Z  /A,  the  fissility  parameter  x  »  the  internal  phase  angle  of 
the  proton  number  8|  required  for  the  clean  fission  of  nuclei  by  thermal  neu¬ 
trons,  the  static  magnetic  induction  field  B^  required  to  catalyze  clean  fission 
of  nuclei  with  thermal  neutrons,  and  the  dynamic  magnetic  induction  field  B^  of 
Y  rays  at  the  giant  dipole  resonance  frequency  which  is  required  to  catalyze 
clean  fission  of  nuclei  by  thermal  neutrons.  Table  2  gives  the  ratio  Cr  of  the 
dynamic  magnetic  induction  field  to  the  static  magnetic  induction  field  required 
for  clean  fission,  the  dynamic  magnetic  stiffness  coefficient  KgJ  for  y  rays  at 
the  giant  dipole  resonance  frequency,  the  nuclear  spring  constant  k  ,  the  giant 
dipole  resonance  frequency  fr  ,  and  the  wavelength  Ar  of  y  rays  corresponding  to 
the  giant  dipole  resonance  frequency.  Table  3  gives  the  dynamic  magnetic  field 
Hy  of  y  rays  at  the  giant  dipole  resonance  frequency  which  is  required  to  cata¬ 
lyze  clean  fission  in  atomic  nuclei  using  thermal  neutrons,  the  dynamic  electric 
field  strength  Ey  of  y  rays  at  the  giant  dipole  frequency  required  to  catalyze 
clean  fission  in  nuclei  using  thermal  neutrons,  the  resonant  power  density  Py 
of  y  rays  required  for  the  clean  fission  of  nuclei  utilizing  low  energy  thermal 
neutrons,  the  resonant  y  ray  energy  ey  required  for  catalysis  of  clean  fission, 
the  resonant  y  ray  flux  required  for  clean  fission  catalysis,  and  the  reso¬ 
nant  y  ray  photon  number  density  n^  required  to  catalyze  clean  fission  in  atomic 
nuclei  using  low  energy  thermal  neutrons. 

For  clean  fission  nuclear  reactions  of  the  subactinide  nuclei  using  thermal 
neutrons,  the  y  rays  are  used  only  to  bring  the  internal  phase  angle  of  the  atomic 
number  02  up  to  its  critical  value  e|  required  for  fission  as  described  in  Sec¬ 
tion  5.  The  very  small  value  of  the  viscosity  of  nuclear  matter  gives  a  small 
value  to  the  parameter  a  that  is  defined  for  the  resonance  frequency  condition 
by  equations  (631),  (632)  and  (649) .  The  small  value  of  o  when  used  in  equa¬ 
tions  (636)  through  (639)  gives  the  relatively  low  values  for  the  electric  and 
magnetic  fields,  power  density  and  flux  of  the  resonant  y  rays  that  are  required 
for  the  clean  fission  of  subactinide  nuclei.  Therefore  immersing  subactinide 
nuclei  in  a  bath  of  y  rays  that  are  tuned  to  the  giant  resonance  frequency  of 
the  nuclei  requires  only  a  relatively  low  power  density  of  y  rays  to  catalyze 
clean  fission  using  thermal  neutrons.  The  low  y  ray  intensity  (associated 
with  Gj  and  Sr  of  the  nuclei)  required  for  clean  fission  at  the  giant  dipole 
resonance  frequency  compared  to  the  high  values  of  the  static  magnetic  field 
(associated  with  G  and  k  of  the  nuclei)  required  for  fission  in  a  static  field 
is  related  to  the  behavior  of  the  nuclear  impedance  S  given  by  equation  (620) 
and  shown  in  Figure  2  which  Indicates  that  the  impedance  has  a  deep  minimum  at 
the  giant  dipole  resonance  frequency.  The  y  ray  intensity  required  for  clean 
fission  has  a  minimum  value  when  the  energy  of  the  incident  y  rays  corresponds 
to  the  giant  dipole  resonance  frequency  of  the  subactinide  element  that  is 
chosen  as  a  nuclear  fuel.  Thus  thermal  neutron  induced  fission  in  the  subac- 
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tinides  is  catalyzed  in  the  simplest  way  by  y  rays  tuned  to  the  giant  dipole 
resonance  frequency  of  nuclei  as  given  by  equations  (652)  and  (653) ,  and  whose 
intensity  is  determined  by  the  critical  condition  6Z  =  e|  as  in  equations  (483) 
and  (636)  through  (639) . 

D.  y  Ray  Catalyzed  Low  Energy  Clean  Fission  Nuclear  Reactions. 

Several  examples  of  y  ray  catalyzed  clean  fission  reactions  induced  by 
thermal  neutrons  are  now  considered.  The  average  incident  thermal  neutron  ener¬ 
gy  is  0.025  MeV. 

Intermediate  Weight  Nuclei. 

Examples  of  clean  fission  reactions  whose  fission  products  are  low  level 


beta  emitters  are 

Y  +  Jn  +  ” Co  -  2jNe  +  J7C1  +  2  Jn  +  y  (655) 

Y  +  Jn  +  2gNi  +  170  +  ^Ca  +  2  Jn  +  y  (656) 

Y  +  Jn  +  ”Cu  -  2 gF  +  ^Ca  +  2  Jn  +  y  (657) 

y  +  Jn  +  8JZn  -  JjNe  +  “Ca  +  2  Jn  +  y  (658) 

y  +  Jn  +  l\As  +  27Mg  +  2 iSc  +  2  Jn  +  y  (659) 


These  fission  reactions  produce  low  level  radionuclides  as  waste  products.  Ex¬ 
amples  of  perfectly  clean  fission  reactions  using  thermal  neutrons  and  y  ray 
catalysis  are 

Y  +  Jn  +  27C0  -  JjNe  +  JJci  +  3  Jn  +  y  (660) 

Y  +  Jn  +  88Ni  -  “O  +  l°QCa  +  3  Jn  +  y  (661) 


The  fission  products  of  these  reactions  are  not  radioactive  and  can  be  used 
for  the  development  of  clean  fission  nuclear  reactors. 

Heavy  Nuclei. 

The  following  are  examples  of  y  ray  catalyzed  fission  reactions  using 
thermal  neutrons  whose  fission  products  can  be  relatively  low  level  radionu¬ 
clides  as  in  the  case  when  87Rb  and  8 8 Sr  are  fissioned,  but  with  high  level 
radionuclides  as  fission  products  when  the  heavy  elements  are  fissioned  such 
as  in  the  case  of  238U  and  232Th 


y  +  Jn  +  8  7Rb  -  ”0  +  ”Cu  +  2  Jn  +  y  (662) 

Y  +  Jn  +  3®Sr  -  28F  +  8  JCu  +  2  Jn  +  y  (663) 
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y  +  0ln  +  !o°Zr  i2oNe  +  3 3 Zn  +  2  on  +  y  <664) 

Y  +  Jn  +  aJJl  -  J*C1  +  ®67Kr  +  2  Jn  +  Y  (665) 

Y  +  Jn  +  Xs?La  ♦  la71  +  37Kb  +  2  Jn  +  y  (666) 

Y  +  Jn  +  1| JSm  ->  6JCo  +  J|Br  +  2  Jn  +  y  (667) 

Y  +  Jn  +  1®jHf  ♦  JjBr  +  *jRb  +  2  Jn  +  y  (668) 

Y  +  Jn  +  l8jTa  +  ®|Br  +  ®jRb  +  2  Jn  +  y  (669) 

Y  +  Jn  +  2J®Pb  -  JjKr  +  “JPd  +  2  Jn  +  y  (670) 

Y  +  Jn  +  2  J  jTh  -  J^Ru  +  1 | 3I  +  2  Jn  +  y  (671) 

Y  +  Jn  +  2J8U  ■*  JJSr  +  “Jxe  +  2  Jn  +  y  (672) 


These  fission  reactions  would  not  occur  using  thermal  neutrons  without  the  cata¬ 
lytic  effects  of  the  resonant  y  rays. 

Nuclear  fission  is  a  statistical  process  of  penetration  through  a  fission 
barrier,  and  many  other  fission  reactions  are  possible.  For  a  sustained  fission 
chain  reaction  at  least  two  neutrons  must  be  produced  by  the  fission  reactions. 
For  both  intermediate  and  heavy  nuclei  generally  at  least  one  of  the  fission 
product  nuclei  of  these  reactions  is  beta  unstable  for  clean  fission  reactions. 
Equations  (660)  and  (661)  are  exceptional  in  that  the  fission  products  are  not 
radioactive.  Because  the  fission  product  nuclei  of  intermediate  weight  elements 
are  relatively  light  weight  nuclei  close  to  the  valley  of  beta  stability-  the 
intensity  of  beta,  alpha  and  neutron  radioactivity  of  the  fission  products  is 
low  level  for  the  y  ray  catalyzed  fission  of  intermediate  weight  elements  using 
thermal  neutrons  as,  for  example,  in  equations  (655)  through  (659).  Equations 
(662)  through  (672)  show  that  the  y  ray  catalyzed  fission  reactions  of  heavy 
nuclei  using  thermal  neutrons  yield  fission  products  that  are  relatively  high 
level  beta,  alpha  and  neutron  emitters  because  the  fission  products  generally 
have  large  neutron  excesses.  Therefore  for  the  development  of  clean  fission 
nuclear  reactors  fuel  elements  only  slightly  heavier  than  26Fe  must  be  used  as 
shown  in  equations  (655)  through  (661). 

E.  Photonuclear  Reactions  and  Clean  Fission  for  Subactinide  Nuclei. 

This  section  considers  the  form  of  the  electric  dipole  sum  rule  for  y  ray 
catalyzed  thermal  neutron  induced  fission  reactions  in  nuclei  whose  fissility 
parameters  satisfy  x  ^  1  .  Photonuclear  reactions  are  induced  by  y  ray  photons 

interacting  with  atomic  nuclei.  For  low  energy  photons  the  process  in  its  sim¬ 
plest  form  is  due  to  resonance  fluorescence  which  occurs  when  photon  absorption 
by  a  nucleus  is  followed  by  photon  emission  as  the  nucleus  decays  to  the  ground 
state.  Nucleon  emission  and  nuclear  fission  can  occur  for  higher  energies  of 
the  incident  y  rays.  At  photon  energies  of  about  12-20  MeV  the  giant  dipole 
resonance  is  excited.6 0-6  ’ 108 
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The  conventional  electric  dipole  sum  rule  for  photonuclear  reactions  is 
written  in  the  standard  incoherent  spacetime  form  as  follows6 0-6 7 ’ 1 0 8 


G.  =  f  a  de  =  gZN/A 
inc  J  ° 


(673) 


where  o  =  photonuclear  reaction  cross  section  for  incoherent  spacetime,  Z  ,  N 
and  A  *  atomic  number,  neutron  number  and  atomic  mass  number  of  the  target  nu¬ 
cleus,  and  where108 


g  =  2tt  e  h/ (m  c)  ^  0.06  MeV  b 
av 


(674) 


where  the  integral  is  taken  over  photon  energies  up  to  30  MeV.  The  concept  of 
the  broken  symmetry  forms  of  the  atomic  number,  neutron  number  and  atomic  mass 
number  suggests  that  a  complex  number  generalization  of  the  photonuclear  reac¬ 
tion  sum  rule  should  be  written  as 


G  =  /  a  de  =  gzn/i 


(675) 


where  a  =  complex  number  photonuclear  reaction  cross  section,  G  *  complex  num¬ 
ber  integrated  photonuclear  cross  section,  and  z  ,  n  and  a  =  complex  number 
atomic  number,  neutron  number  and  atomic  mass  number  respectively_which  are 
given  by  equations  (40)  through  (42).  The  complex  numbers  5  and  G  can  be  rep¬ 
resented  as 


0=o  exp(j6a) 


G  =  G  exp( j6g) 


(676) 


which  are  complex  numbers  in  an  internal  space.  Equation  (675)  can  be  written  as 


G  =  gzn/a  =  gZN/A  cos  6  cos  6  cos  9 

z  n  a 

q=0  +0  _0 

G  z  n  a 


(677) 

(678) 


Equation  (675)  can  also  be  written  as 
00 

G  =  /  o  sec  6  exp[j(0  +9  +8  )]de 


(679) 


=  /  ae  esc  8  exp[j(9  +0  +  ) J d© 


(680) 


where  the  complex  number  photon  energy  is  written  as 


e  =  e  exp(j0£) 


(681) 


and  where 


tan  0  =e39  /de 

€  £  61 


(682) 


The  component  form  of  equations  (679)  and  (680)  are  written  as 
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(683) 


G  cos  0.  =  J  a  sec  6  cos(0  +6  +8  )de 

G  JQ  ee  o  e  ee 

7T  /  6 

=  /  0£  CSC  8  cos (0  +  0  +  3  )d0  (684) 

^  G£  U  G  GG  G 


G  sin  6  =  J  a  sec  8  sin(0  +6  +8  )de  (685) 

v?  q  ££  0  G  G  G 

7r/6 

=  /  oe  esc  8  sin(0  4-  6  +8  )u6  (686) 

O  ££  O  £  ££  C  7 

which  are  generally  valid  equations .  From  equations  (677)  and  (678)  it  follows  that 

G  cos  B„  =  gZN/A  cos  0  cos  0  cos  *0  cos(0  +0  -  0  )  (687) 

G  °  znazna 

G  sin  0„  =  gZN/A  cos  0  cos  0  cos  *0  sin(0  +0  -  0  )  (688) 

G  zn  a  z  n  a 

Generally  G  cos  0Q>OifZ>O,N>O  and  A  >  0  . 

The  upper  integration  limit  of  ir/6  in  equations  (680),  (684)  and  (686) 
arises  from  the  conservation  of  momentum  for  the  photon-nucleon  interaction 
which  can  be  written  as 


e/c  =  hv/c  =  mv  (689) 

where  the  complex  number  photon  frequency  v  and  the  complex  number  nucleon  ve¬ 
locity  v  are  written  as 

v  =  v  exp(jO^)  v  =  v  exp(j0v)  (690) 

Equations  (689)  and  (690)  can  be  written  as 

e/c  =  hv/c  =rav  0=0=0=0-0  (691) 

e  v  v  x  t 

which  are  valid  for  the  inelastic  photonuclear  reaction.  For  coherent  space- 
time  0X  =  tt/3  ,  0t  =  n/6  and  0V  =  ir/6  ,  so  that  0e  =  tt/6  for  inelastic  photonu¬ 
clear  reactions  in  coherent  spacetime.  It  should  be  noted  that  for  elastic  col¬ 
lisions  of  photons  with  particles,  the  phase  angle  relation  is  0e  =  20v  ,  with 
the  result  that  9e  =  u/3  for  elastic  collisions  in  coherent  spacetime. 

If  the  photon  energy  is  taken  to  be  incoherent  it  can  be  represented  as  a 
real  number  with  0J-  =  0  and  8ee  =  0  and  equation  (675)  becomes 

G  =  J  a  de  =  gzn/a  (692) 

while  equations  (683)  and  (685)  give 
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(693) 


G  cos  0 _  =  /  a  cos  0  de 

f:  '  n 


G  sin 


f  a  sin  0  de 

J  n 


(694) 


which  are  valid  for  incoherent  photon  energies.  The  lefthandside  of  equations 
(693)  and  (694)  are  given  by  equations  (687)  and  (688).  If  the  photon  energy  is 
taken  to  be  coherent  with  de  =  jed0e  where  the  photon  energy  magnitude  is  given 
by  e  =  ec  =  constant  and  where  See  =  tt / 2  ,  then  equation  (675)  becomes 

ir/6 

G  =  j  /  oe  dfi  =  gin/a 


(695) 


and  equations  (684)  and  (686)  become 


tt/6 


G  cos  0„  =  -  e  fa  sin(0  +  0  )d0 

Cl  r  J  rr  <r  c 


(696) 


tt/6 


G  sin  0  =  e  fa  cos(0  +  0  )d0 

CZ  x  rr  r  '  e 


(697) 


which  are  valid  for  coherent  photon  energies.  If  o  is  independent  of  0e  then 
equations  (696)  and  (697)  become  with  a  =  ac  =  constant  for  coherent  photon 
energies 


G  cos  0O  =  e  o  [ cos (0  +  tt/6)  -  cos  0  ] 

G  c  c  a  a 


G  sin  Q„  =  e  a  rsin(6  +  tt/6)  -  sin  0  ] 
G  c  C  O  G 


(698) 

(699) 


Therefore  in  general  Qq  >  ir/2  for  coherent  photon  energies,  and  in  particular  if 
0O  =  0  then  0q  =  105°  .  This  means  that  G  cos  0q  <  0  for  coherent  photon  ener¬ 
gies.  Therefore  a  coherent  spacetime  photonuclear  reaction  that  is  described 
by  the  sum  rule  in  equation  (695)  is  not  possible  unless  Z  <  0  ,  N  <  0  and  A  <  0 
in  equation  (687)  which  is  a  case  analogous  to  the  negative  values  of  the  ordi¬ 
nary  magnetic  quantum  number  m  . 


For  light  nuclei  in  the  valley  of  beta  stability  equation  (178)  gives 


0  'V-  0  -v  0 

z  n  a 


(700) 


and  for  this  special  case  equation  (677)  and  (678)  become 
G  =  gZN/A  cos  0 


eG=  9a 


(701) 

(702) 


For  this  case  equations  (693)  and  (694)  become  for  incoherent  photon  energies 

gZN/A  cos^0  =  f  o  cos  0  de  (703) 

a  J  a 

gZN/A  cos  0  sin  0  =  /  o  sin  0  de  (704) 

at  a.  '  o 

tan  0  =  (  '  a  sin  9  de)/(f  o  cos  0  de) 

a  a  J  a 


(704A) 


from  which  a  and  80  can  be  obtained  by  inversion  techniques  for  nuclei  in  the 
valley  of  beta  stability. 

Consider  now  the  case  of  nuclei  with  fissility  parameters  less  than  unity 
X  <  1  that  have  been  brought  to  the  condition  of  incipient  fission  by  the  ap¬ 
plication  of  an  external  y  ray  field.  For  this  case  the  internal  phase  angles 
of  the  atomic  number,  neutron  number  and  atomic  mass  number  are  related  by  equa¬ 
tion  (473)  as  follows 

eF  =  20F  =  20F  (705) 

a  z  n 


and  therefore  equations  (677),  (678)  and  (705)  give 
G  *  gZN/A(l  -  tan20F)_1 


From  equations  (685)  and  (707) 
8=0  0=0 


it  follows  that  for  incipient  fission 


B 


ee 


=  0 


(706) 

(707) 


(708) 


so  that  the  photonuclear  interaction  for  the  case  of  fission  must  be  scalar 
(Qa  =  0)  with  incoherent  photon  interactions,  and  equations  (693),  (694)  and 
(706)  through  (708)  become 

/  o  de  =  gZN/A(l  -  tan20F)  *  (709) 

o  2 

However  it  has  been  shown  in  equation  (480)  that  the  approximate  condition  for 
y  ray  catalyzed  thermal  neutron  induced  clean  fission  is  given  by 

tan  0F  =  (1  -  X)1/2  (710) 

z 

where  x  ^  1  is  the  fissility  parameter.  Combining  equations  (709)  and  (710) 
gives  for  incoherent  photon  interactions 

/  a  de  =  gx  ^ZN/A  (711) 

o 

and  therefore  the  fissility  parameter  enters  into  the  electric  dipole  sum  rule 
for  y  ray  catalyzed  thermal  neutron  induced  clean  fission  reactions. 

8.  y  RAY  CATALYZED  CLEAN  FISSION  NUCLEAR  REACTOR  DESIGN  CONCEPTS.  This 
section  presents  design  concepts  for  y  ray  catalyzed  clean  fission  nuclear  re¬ 
actor  cores  whose  fuel  consists  of  elements  heavier  than  S6Fe  but  lighter  than 
the  actinides.  As  explained  in  the  previous  sections  the  subactinide  elements 
cannot  be  fissioned  by  thermal  neutrons  under  ordinary  circumstances,  but  in 
the  presence  of  y  rays  tuned  to  the  giant  resonance  frequency  of  the  fuel 
elements  these  nuclei  can  be  fissioned  by  thermal  neutrons.  The  y  ray  photon 
number  density  required  to  fission  a  subactinide  nucleus  is  relatively  low  be¬ 
cause  the  y  rays  are  required  only  to  bring  the  internal  phase  angle  0Z  of  the 
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atomic  number  up  to  its  incipient  fission  value  8|  given  by  equation  (483)  at 
which  point  fission  by  thermal  neutrons  is  possible.  When  elements  heavier 
than  56Fe,  such  as  for  example  63Cu,  are  used  as  fuel  in  a  clean  fission  nu¬ 
clear  reactor,  the  fission  products  of  the  thermal  neutron  induced  y  ray  cata¬ 
lyzed  fission  reactions  are  neutrons  and  atomic  nuclei  which  have  relatively 
low  level  radioactivity  because  they  have  smaller  neutron  excesses  than  those 
of  the  fission  product  nuclei  of  239U  or  239Pu  which  are  the  fuel  elements  of 
conventional  nuclear  reactors. 

Relatively  high  energy  y  rays  with  energies  in  the  range  of  10-20  MeV 
are  required  for  the  clean  fission  of  subactinide  nuclei.  Possible  sources  of 
the  relatively  high  energy  y  rays  that  are  required  to  catalyze  clean  fission 
reactions  are 

a)  bremsstrahlung  of  electrons  in  matter 

b)  nuclei  excited  by  collisions  and  subsequent  giant  dipole  resonance 
decay 

c)  synchrotron  radiation  from  electrons 

d)  nuclear  reactions 

e)  nuclear  fission 

f)  y  ray  lasers 

The  possibility  of  using  y  rays  from  the  radioactive  decay  of  natural  and  man¬ 
made  radioisotropes  is  excluded  because  they  generally  are  in  the  range 
0.01  <  <  10  MeV  which  is  too  low  to  excite  the  giant  dipole  resonance  states 

in  atomic  nuclei. 

Figures  4  and  5  show  two  clean  fission  nuclear  reactor  core  design  con¬ 
cepts  in  which  y  rays  are  directed  to  the  surface  of  a  fuel  and  heat  exchanger 
system.  In  this  case  the  fuel  element,  such  as  63Cu,  is  in  the  form  of  rect¬ 
angular  plates  or  a  spherical  shell  which  are  assembled  adjacent  to  a  structure 
which  contains  a  heat  exchanger.  Table  3  shows  that  for  65Cu  as  the  fuel  ele¬ 
ment  the  incident  y  rays  must  be  in  the  range  of  £y  =  20  MeV  and  have  a  photon 
number  density  of  at  least  ny  =  7  x  10^3  photons/m^  which  is  a  relatively  small 
number  density.  The  y  rays  impinge  on  the  surfaces  of  the  fuel  element  plates 
and  elevate  the  valut  of  the  internal  phase  angle  9Z  of  the  atomic  number  of 
the  fuel  element  up  to  the  critical  value  6^  required  for  clean  fission  which 
for  6 3Cu  is  0|  *  38.5°  as  shown  in  Table  1.  Then  an  initial  source  of  thermal 
neutrons  is  introduced  to  start  the  fission  reaction.  The  fission  reaction 
emits  two  or  three  high  energy  neutrons  as  a  fission  product  and  these  can  be 
thermalized  by  moderators  to  produce  a  self  sustaining  chain  reaction.  The 
clean  fission  nuclear  reactions  are  expected  to  occur  mainly  at  the  surface  of 
the  fuel  element  plates  because  y  rays  attenuate  rapidly  in  matter. 

After  an  initial  introduction  of  thermal  neutrons  and  an  initial  input 
of  power  to  activate  the  y  ray  sources  the  output  of  neutrons  and  power  from 
the  clean  fission  reactions  can  be  used  to  create  a  self  sustaining  nuclear 
reactor.  During  operation  of  the  reactor  a  portion  of  the  power  developed  is 
used  to  drive  the  external  y  ray  sources.  The  nuclear  reaction  rates  can  be 
controlled  by  adjusting  the  external  y  ray  flux,  so  that  the  control  rods  of 
the  standard  uranium  fission  reactors  are  not  required  to  control  the  neutron 
flux  in  clean  fission  reactors  but  can  be  retained  as  a  safety  factor.  Mini¬ 
mum  shielding  is  required  for  a  clean  fission  nuclear  reactor  because  of  the 


542 


low  level  radioactivity  of  the  fission  product  nuclei.  Minimum  shielding  is 
also  required  for  high  energy  y  rays  because  they  are  rapidly  attenuated  in 
matter  by  electron-positron  pair  production. 

9.  CONCLUSION.  This  paper  suggests  that  in  the  presence  of  an  electro¬ 
magnetic  or  gravitational  field  the  atomic  number,  neutron  number  and  atomic 
mass  number  have  broken  internal  symmetries  and  must  be  represented  as  complex 
numbers  of  a  special  type.  The  magnitudes  and  phase  angles  of  these  complex 
numbers  are  determined  by  the  requirement  that  the  nuclear  wave  function  be 
periodic  in  terms  of  the  measured  values  of  the  internal  angle  coordinates 
associated  with  the  atomic  number,  neutron  number  and  atomic  mass  number.  In 
the  presence  of  an  external  field  the  number  of  atomic  nuclei  must  also  be  rep¬ 
resented  as  a  complex  number,  and  this  suggests  the  possibility  of  internal 
phase  angle  radioactive  decays  wherein  the  integer  numbers  Z  ,  N  ,  A  and  n  remain 
fixed  but  their  associated  internal  phase  angles  are  decaying  with  time.  For 
all  types  of  nuclear  reactions  and  decays  the  integer  baryon  number  is  con¬ 
served  . 

A  complex  number  form  of  the  liquid  drop  nuclear  mass  formula  can  be 
developed  and  applied  to  the  calculation  of  the  binding  energy  of  atomic  nu¬ 
clei.  Similarly,  a  complex  number  Bohr-Wheeler  fission  equation  can  be  devel¬ 
oped  that  describes  the  condition  for  spontaneous  and  thermal  neutron  induced 
fission  of  atomic  nuclei  located  in  an  electromagnetic  or  gravitational  field. 
This  fission  condition  suggests  that  the  internal  phase  angles  associated  with 
Z  ,N  and  A  can  be  affected  by  the  application  of  an  electromagnetic  field  in 
such  a  way  that  nuclear  fission  by  thermal  neutrons  can  be  catalyzed  by  y  rays 
in  subactinide  nuclei  for  which  the  fissility  parameter  satisfies  x  <  1  and 
which  ordinarily  would  not  undergo  fission  by  thermal  neutrons. 

The  catalysis  of  thermal  neutron  induced  clean  fission  requires  a)  that 
the  frequency  of  the  incident  y  rays  be  tuned  to  the  giant  dipole  resonance 
frequency  of  the  particular  subactinide  element  that  is  selected  to  be  used  as 
fuel  in  a  nuclear  reactor,  and  b)  that  the  incident  y  ray  photon  number  density 
be  larger  than  a  critical  value  required  to  make  the  internal  phase  angle  of 
the  atomic  number  0Z  equal  to  a  critical  value  0Z  that  is  required  for  fission. 
This  means  that  y  rays  in  the  range  of  12-25  MeV  must  be  used  to  catalyze  ther¬ 
mal  neutron  induced  fission  in  the  subactinide  elements,  for  example  63Cu  re¬ 
quires  20  MeV  y  rays.  The  fission  of  subactinide  elements  will  produce  fission 
products  that  have  low  level  beta,  alpha  and  neutron  emissions,  so  that  the 
radioactive  wastes  from  clean  fission  nuclear  reactors  will  be  not  nearly  as 
dangerous  as  the  fission  waste  products  from  conventional  235U  or  2  3  9Pu  nuclear 
reactors.  By  using  intermediate  weight  elements  such  as  59Co  ,  58Ni  or  63Cu  as 
fuel  elements  for  y  ray  catalyzed  thermal  neutron  induced  fission,  it  is  pos¬ 
sible  to  construct  clean  fission  nuclear  reactors  that  have  no  radioactive 
waste  products  whatsoever.  Finally,  the  dangerously  radioactive  fission  waste 
product  elements  of  conventional  235U  and  239Pu  nuclear  reactors  can  be  used 
as  fuel  in  y  ray  catalyzed  clean  fission  reactors,  thereby  offering  a  useful 
way  of  eliminating  present  day  stored  radioactive  wastes. 
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Table  1.  Nuclear  Characteristics  of  the  7  Ray  Catalyzed  Clean  Fission 
of  Actinide  and  Subactinide  Elements  by  Thermal  Neutrons 


Nucleus 

z2/a 

X 

(degrees) 

Bf 

(1012  T) 

*7 

(103  T) 

239PU 

36.97 

1.020 

0 

0 

0 

2«U 

36.33 

1.002 

0 

0 

0 

234JJ 

36.17 

0.998 

2.7 

3.27 

1.34 

235u 

36.02 

0.994 

0 

0 

0 

231I  a 

35.85 

0.989 

6.0 

7.36 

3.02 

227Th 

35.68 

0.984 

0 

0 

0 

238y 

35.56 

0.981 

7.8 

9.64 

3.94 

228ju 

35.53 

0.980 

0 

0 

0 

232*r^ 

34.91 

0.963 

10.9 

13.41 

5.51 

208pb 

32.33 

0.892 

18.3 

22.51 

9.41 

181Ta 

29.44 

0.812 

23.5 

28.91 

12.37 

180Hf 

28.80 

0.795 

24.4 

30.01 

12.86 

1S2Sm 

25.29 

0.698 

28.8 

35.42 

15.61 

139La 

23.37 

0.645 

30.8 

37.88 

16.94 

127  j 

22.12 

0.610 

32.0 

39.36 

17.87 

90Zr 

17.78 

0.490 

35.5 

43.67 

21.00 

88Sr 

16.41 

0.453 

36.5 

44.90 

21.67 

87Rb 

15.74 

0.434 

37.0 

45.51 

22.01 

75As 

14.52 

0.401 

37.8 

46.49 

23.05 

MZn 

14.06 

0.388 

38.0 

46.74 

23.79 

58Ni 

13.52 

0.373 

38.4 

47.23 

24.44 

63Cu 

13.35 

0.369 

38.5 

47.36 

24.17 

59Co 

12.36 

0.341 

39.0 

47.97 

24.75 

58Fe 

12.07 

0.333 

39.2 

48.22 

25.10 

ie0 

4.0 

0.110 

43.3 

53.26 

34.16 

14n 

3.5 

0.097 

43.5 

53.51 

35.09 

uc 

3.0 

0.083 

43.8 

53.87 

36.23 

*He 

1.0 

0.028 

44.6 

54.86 

44.33 
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Table  2.  Nuclear  Characteristics  of  the  7  Ray  Catalyzed  Clean  Fission 
of  Actinide  and  Subactinide  Elements  by  Thermal  Neutrons 


Nucleus 

io10rc 

(102  T/deg) 

k 

(102°  N/m) 

fr 

<1021  Hz) 

*r 

(frn) 

2S9pu 

4.086 

5.026 

1.744 

3.116 

96.19 

233jj 

4.104 

5.048 

1.729 

3.143 

95.38 

23*U 

4.101 

5.044 

1.732 

3.138 

95.51 

235u 

4.098 

5.041 

1.734 

3.134 

95.65 

'31Pa 

4.110 

5.055 

1.724 

3.152 

95.10 

4.122 

5.070 

1.714 

3.170 

94.55 

238y 

4.089 

5.029 

1.741 

3.121 

96.06 

Z28Th 

4.119 

5.066 

1.717 

3.166 

94.69 

232Th 

4.107 

5.052 

1.727 

3.147 

95.24 

208pb 

4.182 

5.144 

1.665 

3.264 

91.84 

181Ta 

4.280 

5.264 

1.590 

3.419 

87.68 

180Hf 

4.284 

5.269 

1.587 

3.425 

87.52 

1S2Sm 

4.407 

5.421 

1.500 

3.624 

82.72 

139La 

4.473 

5.502 

1.456 

3.734 

80.29 

127  j 

4.541 

5.585 

1.412 

3.848 

77.91 

9°Zr 

4.809 

5.915 

1.259 

4.316 

69.46 

88Sr 

4.827 

5.937 

1.250 

4.348 

68.94 

87Rb 

4.836 

5.948 

1.245 

4.365 

68.68 

73As 

u.957 

6.097 

1.185 

4.586 

65.37 

**Zn 

5.090 

6.261 

1.124 

4.835 

62.00 

58Ni 

5.174 

6.364 

1.088 

4.996 

60.00 

63  Cu 

5.103 

6.277 

1.118 

4.860 

61.68 

59Co 

5.159 

6.346 

1.094 

4.968 

60.34 

36Fe 

5.205 

6.402 

1.075 

5.055 

59.30 

16q 

6.413 

7.8P8 

0.708 

7.675 

39.06 

WN 

6.557 

8.065 

0.677 

8.024 

37. 3C 

12C 

6.778 

8.275 

0.643 

8.448 

35.49 

*He 

8.080 

9.938 

0.446 

12.183 

24.60 

Table  3.  Nuclear  Characteristics  of  the  7  Ray  Catalyzed  Clean  Fission 
of  Actinide  and  Subactinide  Elements  by  Thermal  Neutrons 


Nucleus 

H? 

(10s  arnp/m) 

PF 

7 

(1022  y/m2) 

4 

(MeV) 

v 1 

(103*  m'2sec"1) 

"5 

(1023  of3) 

239pu 

0 

0 

0 

0 

0 

0 

233u 

0 

0 

0 

0 

0 

0 

23*U 

1.07 

0.40 

0.02 

12.98 

0.01 

0.03 

235U 

0 

0 

0 

0 

0 

0 

231pa 

2.41 

0.91 

0.11 

13.04 

0.05 

0.18 

22?Th 

0 

0 

0 

0 

0 

0 

238u 

3.14 

1.18 

0.19 

12.91 

0.09 

0.30 

228Th 

0 

0 

0 

0 

0 

0 

232^ 

4.39 

1.65 

0.36 

13.02 

0.17 

0.58 

208pb 

7.49 

2.82 

1.06 

13.50 

0.49 

1.63 

i6iTa 

9.85 

3.71 

1.83 

14.14 

0.81 

2.70 

10oHf 

10.23 

3.86 

1.97 

14.17 

0.87 

2.90 

152Sm 

12.42 

4.68 

2.91 

14.99 

1.21 

4.04 

139La 

13.48 

5.08 

3.43 

15.44 

1.39 

4.62 

127j 

14.23 

5.36 

3.82 

15.92 

1.50 

4.99 

9°Zr 

16.72 

6.30 

5.27 

17.85 

1.84 

6.14 

88Sr 

17.24 

6.50 

5.61 

17.98 

1.95 

6.49 

87Rb 

17.52 

6.60 

5.78 

18.05 

2.00 

6.67 

7  5  As 

18.34 

6.91 

6.34 

18.97 

2.09 

6.96 

6*Zn 

18.94 

7.14 

6.76 

20.00 

2.11 

7.04 

saNi 

19.45 

7.33 

7.13 

20.67 

2.15 

7.18 

63Cu 

19.23 

7.25 

6.97 

20.11 

2.16 

7.22 

59Co 

19.69 

7.42 

7.31 

20.55 

2.22 

7.40 

56Fe 

19.97 

7.53 

7.51 

20.91 

2.24 

7.48 

16q 

27.18 

10.25 

13.92 

31.75 

2.74 

9.13 

X*N 

27.92 

10.53 

14.70 

33.19 

2.76 

9.22 

12C 

28.83 

10.87 

15.67 

34.94 

2.80 

9.34 

*He 

35.27 

13.30 

23.45 

50.40 

2.90 

9.69 
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Figure  1 


FREQUENCY,  f 

Figure  2 


FREQUENCY,  f 


Fig.  1.  Internal  phase  angle  of  the 
atomic  number  that  is  re¬ 
quired  for  the  clean  fission 
of  subactinide  nuclei  versus 
fissility  parameter. 

Fig.  2.  Frequency  dependence  of  the 

impedance  of  an  atomic  nucleus 
(not  to  scale) . 

Fig.  3.  Ratio  of  dynamic  to  static 

magnetic  fields  required  for 
fission  versus  frequency 
(not  to  scale) . 


Figure  3 
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Figure  4.  Design  concept  for  a  7  ray  catalyzed  thermal  neutron 
induced  clean  fission  nuclear  reactor  core. 


Figure  5.  Design  concept  for  7  ray  catalyzed  thermal  neutron 
clean  fission  nuclear  reactor  core. 
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THERMONUCLEAR  REACTIONS  IN  STRONG  GRAVITATIONAL  FIELDS 


Richard  A.  Weiss 

U.S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT .  This  paper  calculates  the  rates  of  thermonuclear  reactions  in 
gases  that  are  located  in  gravitational  fields  or  magnetic  fields  which  are 
sufficiently  strong  as  to  produce  a  partially  coherent  or  totally  coherent 
spacetime  state  in  the  gases.  Within  broken  symmetry  spacetime  the  space  and 
time  coordinates  are  represented  by  complex  numbers  in  an  internal  space,  and 
therefore  the  single  particle  momentum  and  energy  must  also  be  represented  as 
complex  numbers  in  an  internal  space.  The  variation  of  the  space  and  time 
coordinates  and  the  kinematical  variables  such  as  single  particle  momentum  and 
energy  can  occur  in  two  limiting  ways:  a)  an  incoherent  variation  in  which  the 
magnitudes  of  these  quantities  change,  and  b)  a  coherent  variation  wherein  these 
quantities  rotate  in  internal  space  with  fixed  magnitudes.  These  two  extremes 
in  the  variation  of  the  spacetime  coordinates  and  the  kinematical  variables 
determine  the  limiting  forms  of  the  internal  energy,  pressure  and  chemical 
potential  of  a  thermodynamic  system  because  these  physical  quantities  are  derived 
from  integrals  of  distribution  functions  with  respect  to  complex  number  single 
particle  momenta  or  energies.  The  equations  of  state  for  the  noninteracting 
Boltzmann,  Fermi  and  Bose  gases  are  derived  for  the  broken  spacetime  symmetry 
state  that  is  induced  by  a  gravitational  field.  The  complex  number  phase  space 
integrals  for  the  particle  number,  internal  energy  and  pressure  are  developed  for 
the  partially  coherent  spacetime  state  of  classical  and  quantum  gases  and  are 
then  specialized  to  the  limiting  case  of  incoherent  and  coherent  spacetime.  The 
thermonuclear  reaction  rates  for  the  Boltzmann,  Fermi  and  Bose  gases  are 
calculated  for  the  general  case  of  partially  coherent  spacetime ,  and  then 
specialized  to  the  cases  of  coherent  and  incoherent  spacetime  states.  It  is 
suggested  that  the  measured  neutrino  deficit  for  the  sun  can  be  explained  by  the 
combined  effects  of  the  lowered  nuclear  reaction  rates  due  the  effects  of  the 
time  renormalization  group  equation  and  by  a  reduction  in  the  thermonuclear 
reaction  rates  due  to  a  coherent  spacetime  state  which  may  exist  in  some  regions 
of  the  interior  of  the  sun. 

1.  INTRODUCTION.  Element  formation  in  stars  is  due  to  a  series  of 
thermonuclear  reactions.1*8  The  same  reactions  are  responsible  for  energy 
generation  in  stars  whose  light  is  observed  and  analyzed  by  scientists  on  the 
earth  in  an  attempt  to  understand  nucleosynthesis.  The  types  and  rates  of 
nuclear  reactions  depend  on  the  central  temperature  and  density  of  stars,  and 
therefore  the  initial  mass  of  a  star  ultimately  determines  the  relative  abundance 
of  the  elements  that  are  synthesized.1'8  Only  objects  with  masses  greater  than 
0.08  M©  are  stars  with  thermonuclear  reactions  in  their  interiors.  Low  mass 
stars  with  M  <  1  M©  remain  main  sequence  stars  that  undergo  hydrogen  burning  and 
convert  hydrogen  to  helium.1'8  For  stars  with  M  >  1  M©  helium  burning  and  heavier 
element  burning  occurs  and  the  evolutionary  outcome  depends  on  the  initial 
stellar  mass.  For  initial  masses  varying  from  1  to  8  M©  the  stars  can  lose 
enough  mass  to  reach  a  white  dwarf  end  state  with  a  mass  of  about  0.6  M©  and  an 
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interior  consisting  mainly  of  12C  and  degenerate  electrons.1'8  The  maximum  mass 
of  a  white  dwarf  is  M  -  1.4  M©,  the  Chandrasekhar  limit.  For  stars  with  initial 
masses  greater  than  8  Mg  it  is  not  possible  to  lose  sufficient  mass  during  their 
lifetimes  and  the  cores  of  these  stars  undergo  a  series  of  contractions  to  higher 
temperatures  and  densities  which  lead  to  the  thermonuclear  production  of  elements 
heavier  than  helium  and  resulting  finally  in  a  core  consisting  of  56Fe  and 
degenerate  electrons.1"8  Since  56Fe  has  the  greatest  binding  energy  it  cannot 
undergo  subsequent  fusion  reactions  and  the  mass  of  the  iron  core  begins  to  grow. 
When  this  core  of  iron  and  degenerate  electrons  exceeds  the  Chandrasekhar  limit 
a  collapse  occurs  with  a  rapid  electron  capture  by  the  protons  of  the  nuclei 
which  leads  to  neutron  rich  nuclei,  neutron  drip  and  ultimately  to  a  supernova 
explosion  and  the  possible  formation  of  a  neutron  star  if  the  remnant  stellar 
nucleus  has  a  mass  M  <  3  M©.1"8  Supemovae  explosions  are  responsible  for  the 
heavy  element  distribution  in  the  universe . 4,a'13  The  final  evolutionary 
development  for  supernova  remnants  with  M  >  3  M©  results  in  perhaps  a  black  hole 
if  classical  general  relativity  is  valid  at  high  densities.  However,  quantum 
gravity  effects  may  lead  to  different  final  collapsed  states,  but  quantum  gravity 
is  not  yet  properly  understood.  Light  elements  such  as  deuterium,  lithium  and 
helium  were  formed  mainly  during  the  big  bang  nucleosynthesis.1*’15 

Supernova  nucleosynthesis  in  stars  results  in  a  distribution  of  heavy 
elements  throughout  the  universe  in  the  form  of  stars  and  planets.  The  measured 
element  abundances  can  be  used  to  develop  models  of  the  thermonuclear  reactions 
in  stars  and  these  can  be  used  in  conjunction  with  laboratory  measured  fusion 
cross  sections  to  understand  the  temperature,  pressure  and  composition  of  stellar 
interiors .  There  is  good  agreement  between  measured  abundances  of  heavy  elements 
and  the  theoretically  predicted  abundances  based  on  a  standard  model  of 
nucleosynthesis.1'10  Energy  production  in  stars  appears  ultimately  to  an  observer 
on  the  earth  as  electromagnetic  radiation  and  as  particle  radiations  such  as 
neutrinos.  From  an  analysis  of  these  emanations  the  astrophysicist  attempts  to 
develop  physical  models  of  stellar  interiors  that  predict  the  luminosity  and 
spectrum  of  the  photons  and  neutrinos  emitted  from  the  interior  regions.  The 
observed  electromagnetic  emissions  from  stellar  surfaces  agree  very  well  with  the 
theoretical  model  predictions  of  the  stellar  surface  temperatures.2  However,  the 
agreement  between  measured  and  predicted  neutrino  production  is  not  good.  Over 
the  past  decade  it  has  become  clear  that  the  predicted  neutrino  flux  generated 
by  the  standard  model  of  nucleosynthesis  is  about  three  times  larger  than  the 
neutrino  flux  measured  in  detectors  on  the  earth.16'24  Several  explanations  of 
this  discrepancy  have  been  suggested,  including  oscillation  of  supposed  massive 
neutrinos.16'29  It  is  not  yet  clear  whether  the  discrepancy  between  the  measured 
and  predicted  neutrino  fluxes  is  due  to  errors  in  the  standard  nucleosynthesis 
model  or  to  some  other  process  such  as  neutrino  oscillations  during  their  transit 
from  the  sun's  interior  to  the  earth.16'29 

Several  explanations  of  the  experimental  neutrino  deficit  can  be  given  in 
terms  of  the  internal  structure  of  stars.  One  idea  that  has  been  suggested  is 
that  the  calculation  of  the  rates  of  processes  that  occur  in  a  medium  must  depend 
on  the  state  equation  of  the  bulk  matter  within  which  the  processes  occur.30,31 
Thus  the  thermonuclear  reaction  rates  within  the  interior  of  stars  must  depend 
not  only  on  the  values  of  the  temperature  and  density  of  the  stellar  gases  but 
also  on  the  state  equation  of  the  stellar  material.30,31  The  renormalized 
relativistic  time  constant  t’  of  a  process  such  as  a  thermonuclear  reaction 
occurring  in  a  real  gas  must  be  related  to  the  unrenormalized  time  constant  t** 
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for  the  process,  as  given  by  a  standard  calculation,  by  the  following  gauge  in¬ 
variant  time  renormalization  group  equation30*31 

t'  -  Bg  3 1 ’ / 3 E *  +  38p  3t'/3P'  =  ta'  -  3ta  /3Ea ’  (1A) 

where  the  gauge  parameters  are  given  by 


B£  =  T/V’(dUVdT) 

6 1'  ‘  T/V'(dUa/dT)pa,v, 

(2) 

-  d/dV'(P,V.,)u, 

8,'  -  d/dV'(PaV>Ba. 

(3) 

where  T  =  absolute  temperature,  V'  =  volume  of  space  in  broken  symmetry  space- 
time,  U’  and  Ua?  =  renormalized  and  unrenormalized  values  of  the  internal  energy 
in  broken  symmetry  spacetime,  E'  and  Ea'  -  renormalized  and  unrenormalized  val¬ 
ues  of  the  energy  density  in  broken  symmetry  spacetime  and  where  P'  and  Pa'  ■* 
renormalized  and  unrenormalized  values  of  the  pressure  in  broken  symmetry  space- 
time.  The  renormalized  and  unrenormalized  (standard)  rates  of  a  process  are 
given  by  R'  =  1/t'  and  Ra'  =  l/ta'  respectively.  The  time  renormalization  group 
equation  (1A)  is  formulated  to  include  the  Minkowski  metric,  and  generally  pre¬ 
dicts  that  processes  occurring  within  a  real  medium  run  slower  (t '  >  ta ' )  than 
the  standard  calculations  would  predict.30  For  an  ideal  gas  Bjr  =  0  and  Bp  *  0 
so  that  t'  =  ta’  .  Quantized  versions  of  the  time  equation  (1A)  have  been  for¬ 
mulated  which  suggest  that  energy  and  pressure  fields  can  have  a  time  structure, 
and  that  the  rates  of  processes  such  as  thermonuclear  reactions  can  be  depressed 
or  enhanced  according  to  how  structures  of  time  can  form  within  the  matter  in  the 
interior  of  stars.31  The  quantized  version  of  the  time  equation  is  written  as31 

(i  -  ui'Ht'  -  b£  3 1 y 3 e ' )  +  3 Bp  at’/ap'  =  o  ub) 

where  u'  can  be  a  discrete  or  continuous  eigenvalue.  For  an  ideal  gas  u>'  *  1  , 
Bf  =  0  and  Bp  *  0  . 

Probably  more  than  one  effect  is  responsible  for  the  measured  neutrino 
deficit,  and  equations  (1A)  and  (IB)  are  only  part  of  the  solution.  This  paper 
suggests  an  additional  effect  that  causes  another  portion  of  the  neutrino  def¬ 
icit,  namely  that  the  discrepancy  between  the  experimental  and  predicted  neu¬ 
trino  emission  rates  from  the  sun  is  also  due  to  an  overestimation  of  the  neu- 
trimo  generation  rate  Ra '  in  the  standard  stellar  models  because  of  the  assump¬ 
tion  that  the  stellar  interior  can  be  described  by  an  incoherent  spacetime  Max- 
well-Boltzmann  gas.2  It  is  suggested  in  this  paper  that  in  fact  stellar  inte¬ 
riors  should  be  described  by  a  Maxwell-Boltzmann  gas  in  partially  coherent  or 
totally  coherent  spacetime,  and  that  this  combined  with  the  time  equations  (1A) 
and  (IB)  if  interactions  are  considered  can  describe  the  measured  neutrino  def¬ 
icit.  For  ideal  gases  the  time  renormalization  group  equations  (1A)  and  (IB) 
give  R'  =  Ra'  ,  and  only  the  effects  of  broken  symmetry  spacetime  contibute  to 
the  measured  neutrino  deficit. 

A  material  existing  in  coherent  spacetime  is  a  special  case  of  a  material 
embedded  in  broken  symmetry  spacetime  having  partial  coherence.  The  broken 


557 


symmetries  of  spacetime  can  be  described  by  writing  the  spacetime  coordinates 
as  complex  numbers  in  an  internal  space  in  the  following  manner32 

t  =  t  exp ( j  0 1 )  (4) 

x  =  x  exp(j0  )  y  =  y  exp  (j  0  )  z  =  z  exp(j0  )  (5) 

A  y  z 

where  0(.  ,  0X  ,  0y  and  02  =  internal  phase  angles  of  time  and  space.  The  volume 
of  space  is  written  for  the  general  case  of  broken  symmetry  spacetime  as30-32 

V'  =  / 1 dV |  =  /  sec  d V  -  /  esc  6^  Vd0y  (6) 

where 

tan  0^  =  V30V/8V  (7) 

For  incoherent  spacetime3 0-3 2 

-  o  V’  -  V  (8) 

while  for  coherent  spacetime3 0-32 

-  */2  V’  -  V8V  (9) 

0=0*0=  m/3  0  =  -rr/6  (10) 

«  y  z  l 

Exhaustive  treatments  of  the  statistical  mechanics  of  matter  for  incoherent 
spacetime  can  be  found  in  the  literature. 3 3-3 5  This  paper  extends  this  litera¬ 
ture  by  treating  the  statistical  mechanics  of  ideal  gases  for  partial  and  total 
coherence  of  spacetime. 

The  calculation  of  the  measured  pressure  and  internal  energy  of  a  thermo¬ 
dynamic  system  requires  the  solution  of  a  renormalization  group  equation  which 
relates  the  renormalized  relativistic  internal  energy  and  pressure  to  the  unre¬ 
normalized  (standard)  values  of  the  internal  energy  and  pressure  by  the  follow¬ 
ing  relativistic  trace  equation3 0-3 2,3 6 

U’  +  T(dUVdT)  -  3V'd/dV'  (P'V’)_,  =  U3’  +T(dUa'/dT)_  (ilA) 

P'V'  U  pa'v 

where  P'  and  U'  =  renormalized  complex  number  pressure  and  internal  energy  in 
broken  symmetry  spacetime,  and  Pa*  and  Ua'  =  unrenormalized  pressure  and  inter¬ 
nal  energy  in  broken  symmetry  spacetime.  If  the  complex  number  renormalized 
pressure  and  internal  energy  are  written  as32 

P'  =  P'  exp (j0p)  U'  =  D'  exp(j0^)  (12) 

then  the  measured  pressure  and  internal  energy  are  given  by32 
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P '  COS  0p 


U'  cos  0^ 


(13) 


P' 

m 


U'  - 
m 


For  ideal  gases  (for  which  P'V'  =  2/3U')  it  follows  from  the  relativistic  trace 
equation  (11A)  that36 


P’  »  P3'  O’  =  ua' 

The  quantum  version  of  equation  (11A)  is  written  as31 

(1  -  u’)[U'  +  T(dU'/dT)_  ]  -  3V'd/dV'  (P'V’)_  =  0 

P'V'  U' 


(14) 


(1  IB) 


where  p’  =  continuous  or_discrete  eigenvalue.  For  interacting  systems  the  un¬ 
renormalized  quantities  Ua'  and  Pa'  must  first  be  calculated,  and  then  the  rel¬ 
ativistic  trace  equation  (11A)  must  be  solved  to  determine  U'  and  P'  in  order 
to  obtain  the  measured  values  of  the  pressure  and  internal  energy  given  by  equa¬ 
tion  (13).  Equations  (11A)  and  ( 1 IB)  are  renormalization  group  equations  that 
are  analogous  to  the  time  equations  (1A)  and  (IB)  respectively.  For  a  complete 
analysis  equations  (1A)  and  ( 1 1 A)  or  equations  (IB)  and  (11B)  must  be  solved 
jointly  to  determine  the  thermonuclear  reaction  rates  in  stars  whose  gases  have 
real  state  equations.  The  effects  of  broken  symmetry  spacetime  appear  in  all 
four  equations  (1A),  (IB),  (11A)  and  (11B)  through  the  variable  V'  .  However, 
in  this  paper  ideal  gases  are  treated  and  none  of  these  four  equations  need  be 
considered  because  t'  =  ta'  and  U'  =  Ua'  and  the  superscript  "a"  will  be  dropped 
throughout  this  paper.  Only  the  effects  of  broken  spacetime  symmetry  on  ideal 
gas  state  equations  and  thermonuclear  reaction  rates  are  considered  in  this  paper . 


Consider  now  the  velocity  and  acceleration  of  a  particle  located  in 
spacetime  with  broken  Internal  symmetries.  For  simplicity  only  the  radial  co¬ 
ordinate  is  considered  but  similar  expressions  hold  for  cartesian  coordinates. 32 
For  spacetime  with_broken  internal  symmetries  the  change  in  the  complex  number 
radial  coordinate  p  and  the  change  in  the  complex  number  time  t  are  written  asj2 


dp  =  sec 

Bpp 

dp  exp(j  (0p  + 

V1 

(15) 

*  CSC 

6 

pd0  exp [ j (0 

+ 

(16) 

pp 

P  P 

PP 

dt  »  sec 

BP 

tt 

dt  exp [ j (0P  + 

BPtt>] 

(17) 

*  CSC 

gp 

tt 

td9P  exp [ j (0P 

+  BPt)] 

(18) 

where 

tan  e£t  -  tae£/8t 

tan  £  ■  p38  /3p 

PP  P 

(19) 

The  velocity  can  be  written  as 

v  ■  v  exp(j8v)  *  dp/dt  “  d/dt[p  exp(j0^)]  (20) 
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where 


v  =  cos  8  _  sec  8  dp/dt 

tt  pp 


=  sin  BP  esc  8  p/t  d8  /d0p 
tt  pp  p  t 


0=0+8  -ep-6p 

v  p  Ppp  t 


For  coherent  spacetime 


tt 


6  =  */2 
PP 


so  that 

dp  =  jpd0 


6tt  " 


dt  «  j  td@P 


For  the  coherent  spacetime  state  it  follows  that 


vC°h  =  p/t  d0  /d0P 
P  t 


ecoh  -  e  -  6° 

V  P  t 


(21) 

(22) 

(23) 

(24) 

(25) 

(26) 


where  0P  =  internal  phase  angle  of  time  that  is  associated  with  the  time  vari¬ 
ation  of  the  radial  coordinate.  In  general  each  coordinate  x  ,  y  ,  z  or  r  ,  <f>  ,  z 

or  p  ,  <p  ,  i|»  has  it's  own  internal  phase  angle  of  time  0*  ,  0^  ,  0Z  or  0r  ,  0^  ,  0Z  or 
ft  A  .  i!i  tttttt 


ftp  A* 

9t  *  9t  »  8t  * 


The  change  in  velocity  is  written  as 


dv  =  sec  8  dv  exp[j(0  +8  )} 

w  r  J  v  w 


esc  8  vd0  exp[j(0  +8  )] 

w  v  v  w' 


32 


where 


tan  8  =  v30  /3v 

w  v 


and  therefore  the  acceleration  is  given  by 


a  *  a  exp(j8  )  =  dv/dt 
3i 


(27) 

(28) 

(29) 

(30) 


where 
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cos 

«p 

6tt 

sec 

6w 

dv/dt 

(31) 

cos 

RP 

6tt 

sec 

6w 

d/dt(cos  8Pt 

sec  8 

PP 

dp/dc) 

(32) 

sin 

ftP 

6tt 

CSC 

Bw 

v/t  d8  /d6P 
v  t 

(33) 

sin2 

8P 

tt 

CSC 

6w 

2 

esc  8  p/t 

PP 

de  /dep 

v  t 

de  /dep 

p  t 

(34) 

e  =  e  +  0  -  ep  -  $p 

a  v  w  t  tt 


6p  +  Bw  +  6pp  -  2<  +  6°tt> 


Equation  (34)  can  also  be  written  as 
a  »  sin2g£t  esc  8pp  esc  8^  p/t2  (d8p/d8p 


i  +  d/de; (8 

t  pp 


BPtt)ldep/d0Pt 


For  coherent  spacetime  equation  (24)  is  valid  and  equation  (29)  becomes 
tan  8coh  =  de  /dep(de  /dep  -  i)/(d2e  /dep2) 

w  p  t  p  t  p  t 

while  from  equations  (24)  and  (35)  through  (38)  it  follows  that 

acoh  -  p/t2  [(dep/dej)2(dep/dej  -  l)2  +  (d2ep/deBVl1''2 
9coh  .  e  .  26p  +  jtoh  .  „/2 

a  p  t  w 

These  equations  simplify  further  if  8p  *  be£  +  e  with  b  ^  1 

Bcoh  =  _  tt/2  d2e  /dep2  -  0  de  /deP  -  b 

w  p  t  p  t 

and  equation  (28)  becomes  for  coherent  spacetime 
dv  =  Jvde 


while  equations  (39)  and  (40)  become 

aC°h  -  p/t2  jb(b  -  1)|  -  p/t2  b (1  -  b) 

ecoh  *  q  -  2ep  -  ir 

apt 
because  in  general  b  <  1  . 

The  single  particle  momentum  is  written  as 


(35) 

(36) 

(37) 

(38) 

(39) 

(40) 

(41) 

(42) 

(43) 

(44) 


561 


(45) 


p  =  p  exp (j6p)  =  mv 
p  =  mv  6p  =  6v  (46) 


so  that  equations  (27)  and  (28)  give 


dp  =  sec  8  dp  exp[j(0  +8  )] 

K  PP  J  P  PP 

(47) 

•  esc  8  pd0  exp[j(0  +8  )] 

PP  F  P  F  J  P  PP 

(48) 

where 

tan  R  =  p30  /3p 

PP  P 

(49) 

The  single  particle  energy  is  written  as 

£  —  £  exp ( j 0 e )  *  p2/(2m) 

(50) 

so  that 

£  =  p2/ (2m)  0£  =  20p  =  20v 

(51) 

where  0V  is  given  by  equation  (23) .  The  variation  of 
particle  energy  is  then  written  as 

the  complex  number  single 

de  =  sec  8  de  exp[j(0  +  B  )] 

t  b  C-  tb 

(52) 

=  esc  8£E  ed6£  exp[j(0e  +  Be£)3 

(53) 

where 

tan  6  =  e30  / 3e 

ee  e 

(54) 

For  a  coherent  spacetime  state  described  by  equations  (24)  and  (41)  it  follows 
that 


PP 


and 


=  ir/2 

eEE  -  »/2 

(55) 

=  jpd0p 

de  = 

(56) 

where  the  magnitudes  of  the  single  particle  momentum  and  energy  are  constants, 
p  -  pc  and  e  *  ec  .  Equation  (56)  will  be  used  in  Sections  2  through  5  to  eval¬ 
uate  the  basic  complex  number  integrals  of  the  statistical  mechanics  of  Boltz¬ 
mann,  Fermi  and  Bose  gases  for  the  case  of  coherent  spacetime. 

The  state  equations  and  thermonuclear  reaction  rates  associated  with  the 
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ideal  Boltzmann,  Fermi  and  Bose  gases  in  incoherent  spacetime  are  given  by  well 
known  standard  formalisms.1-3  This  paper  extends  these  calculations  to  the  cases 
of  partially  coherent  and  totally  coherent  spacetime  that  are  associated  with 
matter  in  strong  gravitational  fields.  Specifically,  Section  2  considers  the 
determination  of  the  chemical  potential,  internal  energy  and  pressure  of  an  ideal 
Boltzmann  gas  that  is  located  in  spacetime  with  broken  internal  symmetries.  Sec¬ 
tion  3  studies  the  state  equation  of  a  noninteracting  Fermi  gas  in  a  spacetime 
with  partial  or  total  coherence.  Section  4  treats  the  statistical  mechanics  of 
an  ideal  Bose  gas  in  broken  symmetry  spacetime,  and  finally  Section  5  evaluates 
the  thermonuclear  reaction  rates  that  are  expected  to  occur  in  ideal  Boltzmann, 
Fermi  and  Bose  gases  that  are  located  in  the  partially  coherent  or  totally  co¬ 
herent  spacetime  that  exists  in  the  presence  of  strong  gravitational  fields. 

2.  STATISTICAL  MECHANICS  OF  A  BOLTZMANN  GAS  IN  A  STRONG  GRAVITATIONAL 
FIELD.  This  section  examines  the  statistical  thermodynamics  of  a  noninteracting 
Boltzmann  gas  that  is  located  in  a  gravitational  field  or  other  external  field, 
such  as  an  electromagnetic  field,  that  induces  a  broken  symmetry  in  the  local 
space  and  time  coordinates.  The  Boltzmann  gas  is  important  for  the  study  of 
thermonuclear  reactions  in  ordinary  stars  where  the  nuclei  can  be  described  by 
an  ideal  classical  gas.  The  statistical  mechanics  of  an  ideal  nonrelativistic 
Boltzmann  gas  that  is  located  in  incoherent  spacetime  is  exhaustively  treated 
in  the  literature.33-35  This  section  calculates  the  chemical  potential,  inter¬ 
nal  energy  and  pressure  of  an  ideal  Boltzmann  gas  for  the  case  of  a  partially 
coherent  spacetime  and  for  a  totally  coherent  spacetime. 

A.  Particle  Number  and  Chemical  Potential. 

It  is  assumed  that  the  number  of  particles  n  is  a  complex  number  in  inter¬ 
nal  space,  like  the  complex  magnetic  quantum  number  M'  of  Reference  32,  because 
it  represents  a  quantum  number  for  a  wave  function  that  is  expressed  in  terms 

of  a  complex  number  coordinate  z  in  internal  space  as  follows 

f  =  A  exp (jn£)  n  =  n  exp(j0n)  z  =  Z  exp(j6^)  (57) 

where  ?  ,  il  and  z  =  complex  number  wave  function,  particle  number  and  internal 
space  coordinate  respectively.  Then  by  the  same  argument  given  in  Reference  32 
for  the  complex  magnetic  quantum  number  M  it  follows  that  the  requirement  for 
periodicity  of  the  wave  function  gives 

nc  =  n£  9n  =  -  8  (58) 

The  measured  internal  space  coordinate  is  given  by32 

C  =  Z  cos  0  (59) 

m  Z 

The  requirement  that  the  wave  function  be  periodic  in  terms  of  the  measured 
internal  space  coordinate  gives  the  following  condition32 

nz  *  (n/cos  0  K  *  Nc  (60) 

4  tn  m 
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where  N  *>  integer,  so  that  the  complex  particle  number  can  be  written 


n  =  n  exp(j6n) 


(61) 


=  N  cos  0^  exp(j0^)  =  N  cos  0^  exp(-jO^) 

where  N  =  integer  number  of  particles.  Therefore 

n  =  N  cos  0  =  N  cos  0 

n  C 

and  the  real  and  imaginary  parts  of  the  particle  number  are  written  as 
2 

nR  =  N  cos  0^ 

nT  =  N  cos  0  sin  0 
I  n  n 

The  measured  particle  number  is  then  given  by 


n  =  N  cos  0  =  n  cos  0 

m  n  n 


(62) 

(63) 

(64) 

(65) 

(66) 


The  quantities  N  and  0n  are  assumed  to  be  known,  and  equations  (61)  through  (66) 
immediately  determine  n  ,  n  ,  n^  ,  n^  and  .  These  equations  are_formally  iden¬ 
tical  to  the  equations  for  the  complex  magnetic  quantum_number  M  and  the  integer 
magnetic  quantum  number  m  with  the  correspondence  n  •*-*■  M  and  N  «-*•  m  if  N  is  al¬ 
lowed  to  have  positive  and  negative  integer  values,  or  if  N  is  allowed  to  have 
only  positive  integer  values  these  equations  are  homologous  to  the  equations  for 
the  complex  magnetic  quantum  number  fi*  and  the  integer  quantum  number  |m|  with 
the  correspondence  n  -*-*■  M1  and  N  ■*->■  jm|  . 32 

The  total  particle  number  can  also  be  written  as  an  integral  of  the  Boltz¬ 
mann  distribution  function  over  the  space  of  single  particle  momenta  and  ener¬ 
gies.  3  3-3  s  The  generalization  to  the  case  of  complex  number  momenta  and  ener¬ 
gies  is  written 

n  ■  l  z  exp (-Be  )  (67) 

F  P 

=  4TrV’5/h3  /  p2  exp[-8p2/ (2m)  ]dp  (68) 

«  V’  2tr(2m)  3^2z/h3  /  e^2exp(-6e)de  (69) 


where  V'  «■  broken  symmetry  volume  given  by  equation  (6),  z  *  complex  number 
fugacity,  h  *  Planck’s  constant,  p  =  complex  number  momentum  integration  vari¬ 
able,  8  **  l/(kT),  T  =  absolute  temperature,  m  =  mass  of  particles  and 
£p  =  e  =  p2/(2m)  **  complex  number  kinetic  energy  per  particle  which  in  equation 
(69)  appears  as  an  integration  variable.  The  complex  number  fugacity  is  relat¬ 
ed  to  the  complex  number  chemical  potential  by  the  following  generalization  of 
the  standard  result3 3-3 5 
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where  vi  =  complex  number  chemical  potential.  From  equation  (70)  it  follows  that 


z  =  exp(Bu  cos  0^)  In  z  =  Bp  cos  0^  (71) 

0  —  6u  sin  8  (72) 

z  p 

Inverting  equations  (71)  and  (72)  gives 

tan  0  =0  (In  z)_1  (73) 

p  Z 

u  =  s-1 (02  +  In2 z)1/2  (74) 

z 


which  gives  the  chemical  potential  in  terms  of  the  fugacity. 


Equation  (69)  can  be  written  as 

n  =  V’fzA  =  V’fzA  cxp[j(8  +  8  )]  (75) 

2  A 

n  =  V’fzA  0  =0  +  0A  (76) 

n  z  A 

where 

f  =  2Tr(2m)3/2/h3  (77) 

and  where 

A  =  A^  +  jAj  =  A  exp(je^)  =  J  e^2exp(-gc)de  (78) 

A  =  (A^  +  A2)172  tan  0A  =  AI/AR  (79) 


The  single  particle  kinetic  energy  integration  variable  is  written  as 


e  =  e  exp(j9£) 

(80) 

which  gives 

de  =  sec  8££  exp[j(0£  +  8££)]de 

(81) 

*  e  esc  6££  exp(j(0£  +  g££)]d0£ 

(82) 

where 

tan  6  *  e30  /3e 

ee  e 

(83) 

Then  equation  (78)  can  be  written  as 


A  “  /  e  sec  8  exp(-8e  cos  0  )exp[j(3/20  -  Be  sin  0  +  8  _)]de 

q  £  £  C  £  €  CC 

*/3  ,/2 

*  /  e  esc  8£e  exp(~Se  cos  e^expf  j  (3/20^  -  Be  sin  0^  +  Bc£.)]d0£ 


ee 


so  that  the  component  integrals  become 

oo 

A_  *  /  e  sec  8„  exp(-Be  cos  0  )cos(3/20  -  Be  sin  6  +  B  )de 

K  q  ££  £  £  £  ££ 

tt/3 

■  /  e  esc  8  exp(-8e  cos  0  )cos(3/20  -  Be  sin  0  +  B  )d8 

q  ££  £  £  £  ££  £ 


(84) 

(85) 

(86) 
(87) 


A_  “  /  e*^2  sec  B  exp(-Be  cos  0  )sin(3/20  -  Be  sin  0  +  B  )de 

JL  ^  ££  £  £  £  ££ 

.3/2 


o 

r/3 

=  /  eJ/z  esc  B  _  exp(-Be  cos  0  )sin(3/29  -  Be  sin  0  +  B  )d0 

'  ee  r  e  e  e  ee7  e 


(88) 

(89) 


The  values  of  A  and  0^  are  then  obtained  from  equation  (79).  Equations  (63), 
(76)  and  (79)  can  be  used  to  determine  z  and  0Z  in  the  following  manner 


z  =  n/  (V1  fA) 


(90) 


-  (N  cos  8n)/(V'fA) 

=  (n /cos  0  )/(V'fA) 
m  n 


63  "  6n  -  6 A  (91) 

where  f  is  defined  in  equation  (77),  and  where  n  and  n  are  related  to  the  inte¬ 
ger  particle  number  N  by  equations  (61)  through  (63).  Finally,  equations  (73) 
and  (74)  can  be  used  to  determine  0y  and  u  respectively.  The  measured  particle 
number  is  given  by  equation  (66)  .  Note  that  for  0n  *  0  equation  (76)  shows  that 
the  fugacity  can  still  have  an  internal  phase  angle. 

All  internal  phase  angles  are  set  equal  to  zero  for  the  case  of  incoherent 
spacetime.  For  this  case  equation  (75)  becomes 

N  -  Vfz.  AinC  (92) 

inc 

where  from  equation  (78) 

AinC  -  f  £1/2ex p(-0e)d£  -  l/2/?B_3/2  -  l/2>4(kT)3/2  (93) 

o 

Then  equations  (92)  and  (93)  can  be  used  to  determine  z£nc  .  These  results  are 
the  standard  equations  for  the  ideal  Boltzmann  gas. 3 3-3 s 


566 


Consider  now  the  case  of  incoherent  spacetime  when  the  angle  of  the  sin¬ 
gle  particle  energy  is  given  by  6C  =  ef  =  constant,  then  equations  (86)  and  (88) 
are  written  as 

A^nC  =  /  e^2  exp(-8e  cos  eScosO^e1  -  6c  sin  6i)de  (94) 

=  cos(3/29*)  +  I  sin(3/20*) 

A^nc  =  J  £1/2  exp(-6e  cos  9^)sin(3/26^  -  6c  sin  0^)de  (95) 


where 


■  sin(3/20^)  -  I2  cos (3/ 29^) 

oo  i  ,  2 

I,  *  /  £  exp(-6c  cos  9i)cos(6e  sin  9i)de 


I,  “  /  e  exp(-8e  cos  01)sin(6e  sin  0i)de 

t.  o  e  e 

Setting  e  *  allows  these  integrals  to  be  written  as 

I,  “  2  /  x  exp(-Bx^  cos  01)cos(8xZ  sin  0x)dx 
A  O  e  e 

oo 

I_  *•  2  /  x2  exp(-6x2  cos  9i)sin(8x2  sin  0i)dx 
*•  o  e  e 

These  integrals  are  evaluated  in  tables  of  integrals  with  the  result3 

Ij  *  l/2/lr6"3/2  cos(3/20^) 

I,  *  1/2^B_3/2  sin(3/26i) 

2  £ 


(100) 


(101) 


Combining  equations  (94),  (95),  (100)  and  (101)  gives 


A^nC  -  l/2^6-3/2 


.  inc  _ 
AI  *  0 


(102) 


(103) 


so  that  within  the  approximation  0e  *  9|  *  constant  the  internal  phase  angle  9 
has  r.o  effect  on  the  values  of  the  integrals  which  are  now  just  equivalent  to 
equation  (93)  for  the  case  when  all  internal  phase  angles  are  set  equal  to  zero. 

Consider  the  representation  of  the  particle  number  and  the  calculation  of 
the  chemical  potential  for  a  Boltzmann  gas  in  a  coherent  spacetime  state.  For 
a  coherent  spacetime  state  Bee  ■  it/ 2  and  equation  (85)  becomes 
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rcoh  .coh  ,  ...coh. 
A  =  A  exp(j0A  ) 


(104) 


£J/2W  -  e3/2W  exp(j0w)  -  t3/2(WR  +  JWj) 


Acoh  .  £3/2w 

c 


ecoh  *  0 

a  w 


(105) 


where  e  *  ec  »  constant  magnitude  of  the  single  particle  kinetic  energy,  and 
where 

u73 

W  =  j  /  exp(j3/20  )exp(-Be)d0  (106) 

o  e  € 

*/3 

”  j  J  exp(-6e  cos  0  )exp[j(3/20  -  6e  sin  8  )]d0 


W  ■  j  exp(-6e  cos  8  )sin(8e  sin  0  -  3/20  )d0 

C  £  C  £  €6 


(107) 


W  =  /  exp(-Be  cos  3  )cos(8e  sin  0  -  3/20  )d0 

JL-,  C  £  C  £  £  € 


(108) 


where  the  j  that  appears  in  equation  (106)  results  from  Bee  ■  tt/2  in  equation 
(85)  and  agrees  with  the  results  in  equation  (56) .  The  particle  number  for  the 
coherent  spacetime  state  is  then  obtained  from  equations  (75)  and  (104)  to  be 


n  exp(j0n) 


n  -  V0„fz  ,  A  - 

V  coh 

-  V6„fe3/2z  .  W 

V  c  coh 


V0„fe3/22  .W  exp[  j  (0coh  +  0„)J 
V  c  coh  z  V 


(109) 


where  zcoh  -  zcoh  exp(j8^oh)  and 
W  -  (W2  +  w2)1/2 


tan  0W  «  Wl/WR 


(110) 
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n  »  V0  ffi  7  z  .W 
V  c  coh 


6  =  0C°h  + 

n  z  W 


(111) 


From  equations  (63)  and  (111)  it  follows  that  the  components  of  the  complex 
number  fugacity  are  given  by 

zcoh  "  (N  COS  V/(V0vfAC°h) 


-  (N  cos  0  )/<V8„fe  7  W) 
n  v  c 


(112) 


(113) 


ecoh  =  e  -  etT 

z  n  W 

Then  equations  (73)  and  (74)  can  be  used  to  determine  the  magnitude  and  the 
internal  phase  angle  of  the  chemical  potential  in  coherent  spacetime  ucoh  and 
which  gives  Mcoh  =  Mcoh  exP(j9p°“)  •  T^e  upper  integration  limit  of  tt/3 
in  equations  (106)  through  (108)  is  obtained  from  equations  (10),  (26)  and  (51) 
to  be  0£  -  20v  -  2(6p  -  6t)  =  2(ir/3  -  ir/6)  =  ti/3  . 

B.  Internal  Energy  of  Boltzmann  Gas. 

In  the  presence  of  an  external  field  which  breaks  the  spacetime  coordinate 
symmetry  of  the  vacuum,  the  internal  energy  of  a  Boltzmann  gas  can  be  written  as 
the  following  complex  number  generalization  of  the  standard  scalar  result33-35 

U  -  U  exp(j0u)  =  4irV'i/h3  j  p“  p/(2m)  exp[-8p2/ (2m) ]dp  (114) 

*  V'fzB 


where 


B  *  B  +  jB  *  B  exp(j8  )  =  /  e3',2exp(-Be)de 


(115) 


where  the  broken  symmetry  volume  V'  is  given  by  equation  (6),  f  is  given  by 
equation  (77)  ,  and  where  z  is  given  by  equation  (70) .  For  partial  coherence 
the  integral  in  equation  (115)  can  be  written  as 


B  =  J  e  2  sec  6  exp(-$e  cos  0  )exp[j(5/20  -  Be  sin  0  +  B  )]de 


(116) 


*  /  e  esc  B  exp(-Be  cos  0  )exp[j(5/20  -  Be  sin  0  +  B  )]d0 


(117) 


and  therefore 


BR  -  /  e3  2  sec  B£E.  exp(-Be  cos  0£)cos(5/20£  -  Be  sin  0£  +  8e£)de 


(118) 


/  e3  2  esc  B££.  exp(-Be  cos  0£)cos (5/ 20£  -  Be  sin  0£  +  8e£)d8£ 


(119) 


B_  ■  j  e3  2  sec  B  exp(-8e  cos  0  )sin(5/20  - 

I  "l  £  £  £  £ 


Be  sin  0  +  B  )de 

e  ee 


(120) 


/  c  esc  B  exp(-Be  cos  0  )sin(5/20  - 


Be  sin  0£  +  Bee)d0£ 


(121) 


Equation  (114)  can  be  written  as 


U  -  V'fzB 


6u  ’  +  6b 


(122) 


where 
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B  -  (B2  +  Bj)1/2 


tan  0B  -  Bj/Br 


(123) 


The  average  kinetic  energy  per  particle  is  then  given  by  equations  (75)  and 
(114)  to  be 


eav  -  U/n  -  B/A  -  B/A  exp[j(0B  -  6^] 


eav  =*  U/n  =  B/A 


eav  -  e  -  e 

e  BA 


(124) 


(125) 


Then  the  real  and  imaginary  parts  of  eav  are  given  by 
cavE  *  eav  cos  6c  '  <ASBR  +  AlBIi/(SR  +  V 
eavl  '  Eav  sl"  C-<Vl  -  +  *?> 


(126) 


(127) 


where  AR  and  Aj  are  given  by  equations  (86)  through  (89) - 

For  the  case  of  incoherent  spacetime  all  internal  phase  angles  are  set 
equal  to  zero  and  equation  (122)  gives 


Uinc  -  Vf z  BinC 

inc 


(128) 


where  f  is  given  by  equation  (77),  and  where  equation  (115)  becomes 

„inc  7  3/2  ,  a  \ j 

B  *  J  e  exp(-6e)de 

o 

-  fT5/2r(5/2)  -  3/4Ab"5/2  -  3/4/rr(kT)5^2 
which  is  the  standard  result.33-35 


(129) 


If  the  internal  phase  angle  of  the  single  particle  energy  is  assumed  to 
have  a  constant  value  8e  *  0£  «  constant,  the  integrals  in  equations  (118)  and 
(120)  become 


CD 

BRnC  -  /  e^  2  exp(-Be  cos  0p)cos(5/28^  -  Be  sin  B'Sde 


(130) 


I3  cos(5/20*)  +  I4  sin(5/28j) 


(131) 


co 

B*nc  »  j  e2^2  exp(-Be  cos  8*)sin(5/20^"  -  Be  sin  0*)de 
l  o  tee 

-  I3  sin(5/20*)  -  I4  cos(5/20^) 


(132) 


(133) 


where 
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3/2  i  i 

I .j  =  /  e  exp(-ge  cos  0£)cos(ge  sin  0^)de 

CO 

3/2  i 

I,  =  f  £  exp(-ge  cos  0  )sin(8e  sin  01)de 
4  o  £  E 

By  integration  by  parts  it  is  easy  to  show  that 
I3  =  3/2g'1(I1  cos  0*  -  I2  sin  0*) 

-  3/4/rr8_5/2  cosiS/Zd1) 

£ 


(134) 

(135) 

(136) 

(137) 


I.  -  3/2B~1(I1  sin  01  +  I_  cos  01) 
4  1  £  2  e 

-  3/4/ng~5/2  sin(5/20i) 


(138) 


(139) 


where  Ij  and  I2  are  given  by  equations  (100)  and  (101).  Combining  equations 
(131),  (133),  (137)  and  (139)  gives 


b“c  .  3/4/Wb'5/2 


B^nC  -  0 


(140) 


(141) 


and  it  is  seen  that  within  the  approximation  9e  *  0^  ”  constant  the  phase  angle 
0§  does  not  enter  the  final  expression  for  the  internal  energy  of  a  Boltzmann 
gas,  and  the  result  in  equation  (140)  is  identical  to  that  in  equation  (129) 
for  the  case  when  all  internal  phase  angles  are  set  equal  to  zero. 

For  the  case  of  coherent  spacetime  Bee  ■  if/ 2  and  V'  =  V0y  and  equations 
(114)  and  (117)  become 


UCOh  -  V6  fz  BC°h 

V  coh 

5/2- 

■  V0„feJ/  z  ,  Q 

V  c  cold 

=  V8„fe5/2z  ,Q  exp[ j  (6coh  +  0n)] 

V  c  cold  riJ  z  Q 


(142) 


ucoh  -  ve  fe5/2z  . q 

V  c  cold 

flcoh  -  ocoh  4-  ft 

0U  0z  +  ®Q 


(143) 


(144) 


where  ec  »  constant  value  of  the  magnitude  of  the  single  particle  kinetic 
energy,  and  where 
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(1*5) 


eavl  =  ecQ/W  sin(0Q  ‘  V  (153) 

■  ec<Vl-"lV/<U8  +“?> 

where  W  is  given  by  equation  (106),  and  Wj  are  given  by  equations  (107)  and 
(108),  and  where  W  and  9y  are  given  in  equation  (110). 

C.  Pressure  of  a  Broken  Symmetry  Boltzmann  Gas. 

The  pressure  of  a  Boltzmann  gas  that  is  located  in  a  broken  symmetry 
spacetime,  such  as  may  be  induced  by  a  strong  gravitational  field,  can  be  cal¬ 
culated  from  the  following  generalization  of  a  standard  result33-35 

fi  -  V'z3/9z[P/ (kT) ]  -  V'z3/3z(BP)  (154) 

Combining  equations  (75)  and  (154)  gives  immediately 

P  -  n/V'kT  .  (155) 
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where  V'  is  given  by  equation  (6).  Combining  equations  (61)  and  (155)  gives 


P  ■  N/V'kT  cos  0n  exp(j0n)  (156) 

If  is  constant  it  follows  from  equation  (6)  that 

V  -  V  sec  evv  =  V0V  CSC  evv  (157) 

so  that  for  this  special  case 

P  =  N/VkT  cos  6^  cos  exp(j6^)  (158) 

-  N/(V0v)kT  sin  8VV  cos  0n  exp(j0n)  (159) 


where  N  *  integer  number  of  particles  in  a  container.  The  factors  cos  Byy  or 
sin  8yV  arise  from  the  broken  symmetry  of  spacetime.32  The  factor  cos  0n  arises 
from  the  assumed  periodicity  of  the  quantum  mechanical  wave  function  in  the  mea¬ 
sured  internal  space  coordinates.  The  real  and  imaginary  parts  of  the  complex 


number  pressure  are  given  by 

PR  =  N/VkT  cos  6VV  cos20n  (160) 

-  N/(vey)kT  sin  8^  cos20n  (161) 

Pj  *  N/VkT  cos  flyy  cos  0^  sin  8n  (162) 

*  N/(V8„)kT  sin  Btm  cos  6  sin  0  (163) 

v  vv  n  n 


The  measured  pressure  is  given  by  Pm  “  • 

The  incoherent  spacetime  pressure  of  a  Boltzmann  gas  is  obtained  from 
equation  (160)  with  Byy  *  0  and  9n  ■  0  which  gives  the  standard  result33-35 

P  -  N/VkT  (164) 

IQ 

The  coherent  spacetime  form  of  the  pressure  of  an  ideal  Boltzmann  gas  is  obtain¬ 
ed  from  equation  (161)  with  Byv  *  as 

P  -  N/(V9„)kT  cos 20  (165) 

mV  n 

where  6n  «  -  8^  and  0^  -  phase  angle  of  the  internal  space  coordinate  that  is 
associated  with  particle  number.  As  a  first  approximation  0n  •v  0  so  that  equa¬ 
tion  (165)  becomes 

P  -  -v  N/(V9„)kT  (166) 

m  v 
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for  the  Boltzmann  gas  in  coherent  spacetime. 

3.  IDEAL  FERMI  GAS  IN  BROKEN  SYMMETRY  SPACETIME.  This  section  calculates 
the  chemical  potential,  energy  density  and  pressure  of  an  ideal  Fermi  gas  that 
is  located  in  a  gravitational  field  or  other  external  field  which  breaks  the 
local  symmetry  of  the  spacetime  coordinates.  The  cases  of  partially  coherent 
and  totally  coherent  spacetime  are  considered.  These  calculations  may  be  of  value 
for  the  study  of  the  thermonuclear  reactions  in  stars  where  spin  1/2  nuclei  are 
the  dominant  reacting  particle  species  as  suggested  in  Section  5. 

A.  Complex  Number  Chemical  Potential  for  Fermi  Gas. 

The  chemical  potential  is  determined  by  writing  an  expression  for  the  to¬ 
tal  number  of  particles  in  terms  of  a  sum  over  the  complex  number  single  parti¬ 
cle  momenta  or  single  particle  kinetic  energies.  The  complex  number  generaliza¬ 
tion  of  the  standard  scalar  result  is  easily  written  as3  ~3  5 

n  =  £  [1/z  exp (Be  )  +  1]  1  (167) 

P  p 

=  4t rV’/h^  /  p2{l/i  exp[8p2/ (2m) ]  +  l}  ^dp 

=  fV'/  e^2(l/2  exp(8s)  4-  1]  ^de 

=  fV'J  e1/2[exp(f)  +  l]_1de 

where  f  is  given  by  equation  (77)  and  where 

?  =  8(e  -  C)  (168) 

where  z  and  p  are  related  by  equation  (70) .  The  following  series  expansion  is 
used  to  evaluate  the  Integral  in  equation  (167) 

[exn(C)  +  11  *  =  exp(-£)[l  +  exp(-£)1  *  (169) 

■  I  (~1)V  exp(-vC) 
v®  1 

CO 

■  £  (-1) V  *  iV  exp(-vBe) 
v-1 

Then  equation  (167)  can  be  written  as 

CO 

n  =  f  V'  l  (-1)V-1  zV  (170) 

v*l 

where 

Hy  -  Hv  exp(jeHv)  -  HvR  +  jHvI  -  /  c1/2exp(-vBe)de  (171) 

and  where  f  is  given  by  equation  (77) . 
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The  integral  in  equation  (171)  can  be  rewritten  for 
spacetime  as 

-  00  1/2 

H  “  /  e  sec  6  exp(-vge  cos  0  )exp[j(3/28  -  vge  sin 

v  JQ  ee  e  c 

IT/3  3y2 

«  /  eJ  esc  6  exp(-vBe  cos  6  )exp[j(3/20  -  u6e  sin 

O  C£  c  e 

or  in  component  form  as 
“  1/2 

H  „  **  /  e1  sec  3  exp(-vgc  cos  0  )cos(3/29  -  v8e  sin  6 


vR  i  "  ^ec 

.3/2 


o 

tt/3 


j  £  esc  6  exp(-v8e  cos  0  )cos(3/29  -  vge  sin  6 

Q  £t  EE 


sec  exp(-vgE  cos  0E)sin(3/20e  -  v8c  sin  9 

1T/3.3/2 

esc.  r  -  e' 


«  J  e  ^  esc  8  exp(-v8e  cos  0  )sin(3/20  -  vBe  sin  6 


These  component  terms  give 

2  2  1/2 
H  -  (H  _  +  H  T)  ' 
v  vR  vl' 


tan  8„  =  H  t/H  d 

Hv  vl  v R 


and  then  equation  (179)  can  be  written  as 

ao  1 

n  *  fV'  l  (-l)V-i  zVH  exp[j(v0  +  0  )] 


vl 


partially  coherent 


+  6cc)!dc 

(172) 

+  B£e)Id9E 

(173) 

+  BE£)de 

(174) 

+  BEE)deE 

(175) 

+  BCs>dE 

(176) 

+  Bt£)d9E 

(177) 

(178) 


(179) 


Taking  the  real  and  imaginary  parts  of  equation  (179)  and  using  equations  (64) 
and  (65)  gives 

o  00  1 

N  cos^e  *  fV'  T  (-1)  ZVH  cos (v0  +  9„  )  (180) 

n  *• ,  v  z  nv 

vl 

OO  . 

N  cos  9  sin  9  -  fV’  7  (-l)V_i  A,  sin(v6  +  0n  )  (181) 

n  n  ,  v  z  nv 

vl 

where  N  *  integer  number  of  fermions.  Equations  (180)  and  (181)  are  two  equa¬ 
tions  which  can  be  used  to  determine  z  and  9Z  from  which  u  and  0U  can  be  obtain¬ 
ed  by  using  equations  (73)  and  (74).  Equation  (180)  gives  the  measured  parti¬ 
cle  number  as  shown  in  equation  (66) . 


The  case  of  a  Fermi  gas  in  incoherent  spacetime  is  obtained  by  taking  all 
internal  phase  angles  to  be  equal  to  zero.  Then  equation  (170)  gives 


N  -  fV  l  (-1)V_1 
vl 


V  inc 
Z  j  *1 

inc  v 


(182) 
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where  ec  *  constant  magnitude  of  the  single  particle  kinetic  energy,  and  where 
tt/3 

53  j  /  exp(j3/26e)exp(-v6e)d6E.  (190) 

or 

tt/3 

E  »  J  J  exp(-vBec  cos  6e)expt j < 3/ 20 ^  -  vBec  sin  ec)]de£  (191) 


which  can  be  written  in  component  form  as 
ir/3 

E  _  *  /  exp(-v8e  cos  0  )sin(uBe^  sin  0  -  3/28  )d0  (192) 

VK  q  C  £  C  t  £  £ 

ir/3 

E  T  ■  /  exp(-v6e  cos  0  )cos(v8e  sin  0  -  3/20  )d0  (193) 

vl'  c  e  c  e  ee 
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(E2  +  E2  )1/2 

UvR  vlj 


tan  6  =  E  _/E  B 

Ev  vl  vR 


(194) 


The  values  of  zcoh  =  (zcoh  >  Q^0*1)  can  be  obtained  from  equations  (186)  through 
(189)  by  writing 


n  =  fV6  e3/2 
V  c 


l  (-D 


v=l 


V-l  -V  - 

z  ,  E 
coh  v 


or  equivalently  as 
3/2 


n  =  fV6  eJ/Z  X  (-1)V"1  zV  E  exp[j(veCOn  +  0  )] 

V  c  L ,  coh  v  z  Ev 

v=l 


,coh 


(195) 


(196) 


The  real  and  imaginary  parts  of  equation  (196)  give 

N  cos20  =  fVQ  e3/2  l  (-1)V_1  zV  E  cos(v6Coh  +  6^  ) 
n  V  c  L ,  '  coh  v  z  v" 


vl 


N  cos  0  sin  0  =  fV8.,e 

n  n  V  c 


3/2 


I  (-D 


coh  v 
v-l  v 


vl 


z  ,E  sin(v6 
coh  v  z 


Ev" 

coh 


+  6Ev> 


(197) 


(198) 


,  coh 


Equations  (197)  and  (198)  are  the  two  equations  that  determine  zcoh  and  0Z  for 
coherent  spacetime.  Equations  (73)  and  (74)  can  be  used  to  determine  ucoh  and 
8p°k  .  Equation  (197)  is  the  expression  for  the  measured  fermion  number  for  co¬ 
herent  spacetime  in  accordance  with  equation  (66) . 

B.  Internal  Energy  of  Fermi  Gas. 


For  a  Fermi  gas  located  in  a  strong  gravitational  or  magnetic  field  the 
spacetime  symmetry  is  broken  and  the  internal  energy  is  written  as  a  general¬ 
ization  of  the  standard  scalar  form  as  follows 


3  3-3  ! 


0  *  U  exp(j0y)  =  f V'/  e3/,2[exp(C)  +  1]  *de 


(199) 


where  f  and  E  are  given  by  equations  (77)  and  (168)  respectively.  Using  the 
series  expansion  in  equation  (169)  gives 


u  -  fv  y  (-dv_1  zv3 

V 

V“1 

-  fV’  l  (-1)V  1  zVJv  exp[j(v@z  +  0Jv)3 
v=l 


(200) 

(201) 


where 


Jv  ■  Jv  ■  JvR  +  3Jui  •  J  £3‘,2exp(-'>S€)dE 


<A  +  A) 1/2 

vR  vl 


tan  0 


Jv 


J  T/J  » 
vl  vR 


(202) 

(203) 


The  problem  then  is  to  determine  JvR  and  JvI 
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For  partially  coherent  spacetime  the  integral  in  equation  (202)  can  be 
written  as 

_  00  3/7 

J  =  /  e  sec  8  exp(-v6e  cos  6  )exp[j(5/29  -  vBt  sin  6  +0  )]de  (204) 

V  u  t  £  £  ££ 


tt/3 


c /  2 

*  /  e  '  esc  gc£  exp(-v8e  cos  0£)exp[ j (5/20^  -  vBe  sin  6^  +  B£e)]d6e  (205) 


from  which  the  real  and  imaginary  parts  are  obtained  as 
00  3/2 

JyR  *  /  e  sec  exp(-v6e  cos  9r)cos(5/20r  -  vBs  sin  8r  +  Bc.c.)de 


o 

W3 


ee 


ee 


5/2 

/  e  esc  8  exp(-v8e  cos  0  )cos(5/2Q  -  vBe  sin  0  +8  )d0 


ee 


ee  e 


(206) 

(207) 


03  3/2 

JyI  ■  /  e  sec  8^^  exp(-v6e  cos  0r)sin(5/20p  -  vBe  sin  8c  +  8pC)de 


o 

tt/3 


5/2 


ee 


ee 


V0C 

sin  9  +8  )de 

t  ee 

(208) 

vBe 

sin  0  +8  )d6 

c  ee  e 

(209) 

These  component  values  can  be  used  to  determine  Jv  and  0jv  by  equation  (203), 
and  then  U  is  calculated  from  equation  (201)  in  terms  of  z  and  0Z  which  are 
determined  from  equations  (180)  and  (181).  The  real  and  imaginary  components 
of  the  internal  energy  are  obtained  from  equation  (201)  as 

UR  -  fV'  l  (-1)V-1  zVJv  cos(v0z  +  0Jv)  (210) 

v“  1  "  J 

Ux  -  fV  l  (-1)V_1  zVJv  sin(v8z  +  6Jv)  (211) 

V“1 


The  measured  value  of  the  internal  energy  is  given  by  Um  »  Ur  .  The  average 
complex  number  energy  per  fermion  is  obtained  from  equations  (170)  and  (200) 
to  be 

eav  -  U/n  -  t  I  (-1)V_1  zV3vl/[  l  (-1)V-1  zVHy]  (212) 

v=l  v*l  "" 


whose  real  part  gives  the  measured  average  energy  per  fermion  as  eavm  ”  eavR  • 

The  internal  energy  for  the  case  of  incoherent  spacetime  is  obtained  by 
setting  all  internal  phase  angles  equal  to  zero  and  equation  (201)  becomes 

UinC  -  fV  l  (-l)V_l  z,  JlnC  (213) 

**.  inc  v 

v-1 

where  from  equation  (202) 
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Tinc  c  3/2  .  _  . , 

J  »  J  £  exp(-vBe)de 

v  o 

=  <vB)~5/2r<5/2)  -  3/4/^v"5/2(kT)5/2 


(214) 


Equations  (213)  and  (214)  represent  the  standard  case  for  the  internal  energy 
of  an  ideal  Fermi  gas.  3  3-  3  5  A  comparison  of  equations  (183)  and  (214)  show  that 


HinC  =  2/3gvJinc 
v  v 


(215) 


For  the  incoherent  spacetime  case  with  9e  *  ee  “  constant  it  follows  by  the  same 
arguments  given  in  equations  (130)  through  (141)  that  equations  (206)  and  (208) 
become 


J^pC  =  3/4^(vB)'5/2 


(216) 


J1™  -  0 
vl 


(217) 


which  is  identical  to  the  case  in  equation  (214)  that  was  obtained  by  taking 
0e  -  0  . 


The  case  of  coherent  spacetime  can  be  obtained  by  taking  Bee  =  it / 2  , 
V'  »  V6y  and  e  =  ec  ■  constant  so  that  equation  (201)  becomes 


ucoh  -  fve  l  (-i)v_1  zv  3coh 

V  coh  v 

v=l 


V  /  1\V-1  v  .coh  ...  .coh  ,  Qcoh.. 

fV0v  I  <-l)  zcohJv  exp[j(ve2  +e_Tv)] 


(218) 


(219) 


where  f  is  given  by  equation  (77),  zCoh  and  0^°^  are  given  by  equations  (197) 
and  (198),  and  where  for  coherent  spacetime  equations  (205),  (207)  and  (209) 
become 


fcoh  _  5/2?  _  5/2 


e  '  S  -  e  S  exp(j9_  )  -  t  '  ( S  D  +  jS  T) 
c  v  c  v  r  J  Sv  c  vR  J  vl 


(220) 


coh  ,  5/2 

V  C  V 


e“h  -  0C 

Jv  Sv 


where 


ii/  j 

S  *  j  J  exp(j5/20  )exp(-v6e)d0 


(221) 


(222) 


IT/ J 

j  /  exp(-v8er  cos  0e)exp[ j (5/28^  -  vBe^  sin  0£)]d0e 


(223) 


»  f  ^ 

/  exp(-vBe  cos  8  )sin(v8e  sin  6  -  5/28  )d0 

'  r  c  €  C  £  ££ 


(224) 


Vi 


tt/3 

/  exp(-v6e  cos  8  )cos(vSe  sin  6  -  5/28  )d8 

«'  r  r  r  C  r  r '  e 


and 


(s2  +  s2  )1/2 

vR  vl 


tan  d  m  s  _/S  „ 
Sv  vl  vR 


(225) 


(226) 


Therefore  the  internal  energy  for  fermions  in  coherent  spacetime  can  be  written  as 

(227) 


ucoh  -  fve„£_5/2  l  (-1) 


V  c 


v=l 


v-i  _v  - 
z  ,  b 
coh  v 


so  that 


„coh  5/2  r  .  .  *  v— 1  v  ,  _coh  ,  . 

U„  *  £V0„e.#  I  (-1)  2cohsv  cos(v8z  +  0eJ 


U 


R 

coh 


V  c 


V=1 


Sv 


*  fve„e5/2  f  (-1)V-1  ZV  S  Sin(v8c0h  +  8C  ) 
0  coh  v  z  c" 


V  c 


v=l 


Sv 


(228) 

(229) 


The  measured  value  of  the  internal  energy  in  coherent  spacetime  is  given  by 
equation  (228),  and  can  be  a  negative  quantity.  The  average  value  of  the  in¬ 
ternal  energy  per  fermion  in  the  coherent  spacetime  state  is  obtained  from 
equations  (195)  and  (227)  to  be 


_coh  -coh 
e  »  U  /n 


*  e  [  J  (-1)  z  ,  S  ]/[  £  (-1)  z  s,  E  ] 
cl  *•  '  '  coh  vJ  coh  vJ 

v«l  V“1 


(230) 


where  S^  and  Ey  are  given  by  equations  (222)  and  (190)  respectively. 
C.  Pressure  of  a  Fermi  Gas  in  a  Gravitational  Field. 


The  expression  for  the  pressure  of  a  Fermi  gas  located  in  broken  symmetry 
spacetime  is  given  by  the  following  complex  number  generalization  of  the  stan¬ 
dard  statistical  mechanics  expression3 3 ~ 3 s 


P/(kT)  *  1/V’  l  ln{ 1  +  z  exp(-6e  )] 

P  p 

(231) 

-  4ir/h^  /  p2  £rt[  1  +  z  exp(-8tp)]dp 

(232) 

-  27r(2m)3/,2/h3  /  e1^2  ln[  1  +  2  exp(-8e)]de 

(233) 

It  is  easy  to  verify  the  following  series  expansion 

ln[ l  +  z  exp (-Be) ]  *  £  (-1)V  zV/v  exp(-vBe) 

(234) 

v*l 


Then  the  pressure  of  a  Fermi  gas  in  partially  coherent  spacetime  is  given  by 
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(235) 


P/(kT)  -  f  l  (-1)V_1  iV/v  H 
v=l 

where  f  and  Hv  are  given  by  equations  (77)  and  (171)  respectively.  The  result 
for  the  pressure  in  equation  (235)  can  be  obtained  directly  by  combining  equa¬ 
tion  (170)  with  the  completely  general  equation  (154).  From  the  general  rela¬ 
tion  for  the  ideal  gases 

P  =  P  exp(j0p)  =  2/3U/V'  (236) 

it  follows  from  equations  (200)  and  (235)  that 

Hv  =  2/3BvJy  (237) 


Equation  (235)  can  also  be  written  as 
«  /  i?  r  /  .  \  v-1  v  /  ..  _ 


P/(kT)  =  f  l  (-1)  z  h  exp[j(v0z  +  0^)] 
v—1 


so  that 


PR/(kT)  =  f  l  (-1)V  1  zV/v  Hy  cos(v0z  +  0  ) 

v-1 

Pt/(kT)  -  f  l  (-1)V_1  zV/v  sin(v6z  +  0Rv) 
v=  1 


(238) 


(239) 


(240) 


9P  -  PI/PR 


(241) 


The  measured  pressure  is  given  by  equation  (239). 

For  the  incoherent  spacetime  case  the  pressure  of  a  Fermi  gas  is  obtained 
from  equation  (235)  as 


fkT  l  (-1)V_1  /v  H3 

L '  inc  v 

v-1 


(242) 


2/3f  l  (-l)v-1  Jinc 
L.  inc  v 


where  is  given  by  equations  (183)  and  (215),  and 

jinc  ,  Hinc/(2/3Bv)  m  3/4^v“5/2(kT)5/2 


(243) 


which  is  the  standard  textbook  result. 

For  coherent  spacetime  the  pressure  of  a  Fermi  gas  can  be  obtained  from 
equations  (188)  and  (235)  to  be 
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(244) 


PC0h/(kT)  =  f  l  (-1)V_1  z^h/v  Hv 


.  v-1  _v 


-coh 


V=1 

=  fe3/2  l  (-l)v_1  zv  /v  E 
c  ** ,  coh  v 

v=l 

*  =  fe3^2  £  (-1)V  *  zV  h/v  E  exp[j(6C0^  +  8  )] 

c  •  ,  coh  v  r  J  z  Ev 

v=l 


(245) 

(246) 


For  a  coherent  spacetime  the  pressure  is  given  by 
-coh  3  2/3uc°h/ (V6y) 


(247) 


and  a  comparison  of  equations  (218),  (220),  (227),  (245)  and  (247)  gives 

HCoh  -  2/3BvJC°h 
v  v 


(248) 


or 


E  »  2/3 Eve  S 
v  c  v 

or  equivalently 


(249) 


E  =  2/3Bve  S 
v  c  v 


®Ev  6Sv 


(250) 


Equivalently  the  coherent  spacetime  Fermi  gas  pressure  is  written  as 


-coh 


2»nl12  I  (-ir-1 

V=1 


V-1  _V  - 

coh  v 


2nuln  l  (-i)"-1  z^ohsv  expij(ve'°h  +  eSv) 

V=1 


(251) 

(252) 


Finally,  the  real  and  imaginary  parts  of  the  Fermi  gas  pressure  in  coherent 
spacetime  is  given  by 


„coh 


-  fkT^/2  l  (-o'"1  /v  Ev  cos(v0™"  +  9Ev) 
V“1 


,  V-1  V 


,coh 


2nn\n  I  (-O'1'1  z“ohSv  cos(v9^oh  +  0Ev) 
V-1 


(253) 

(254) 


^oh  «  fkTe372  l  (-1)V_1  zV  t/v  E  sin(v8Coh  +  6„  ) 
1  C  v-1 


coh 


'  1 

Ev 


5/2 


213ft'"  “■  I  (-l)V“l  z^tS.  sin(v6^°h  +  8„..) 
v-1 


'coh  v 


EvJ 


(255) 

(256) 


where  Sv  and  9gv  are  given  by  equation  (226),  and  Ev  and  0gv  are  given  by 
equation  (194). 
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D.  Zero  Temperature  Fermi  Gas  in  the  Presence  of  Gravity. 


Consider  now  the  T  *  0  state  of  an  ideal  Fermi  gas  in  spacetime  with 
broken  internal  symmetries.  The  complex  number  generalization  of  the  standard 
scalar  expression  for  the  particle  number  is  given  by33-35 


n  =  4irVVh3/  p2dp 


(257) 


_  w  _ 

*  4iTV'/n  /  p  sec  8pp  exp[j(30p  +  8pp)]dp 

o  tt/6  - 

*  4irV '  /n  /  pJ  esc  8  exp[j(33  +  8  )  ] d6 

o  PP  P  PP  P 


(258) 


(259) 


where  6pp  is  given  by  equation  (49).  Equivalently  these  equations  can  be  written 
in  terms  of  single  particle  energy  as 

H  *  fV'  /  e1/2de  (260) 


fV'  /  e  '  sec  exp[j(3/26e  +  Be|_)]de 


(261) 


=  fV  J  £  esc  B££  exp[j(3/26£  +  Bee)  ]  d©e 


(262) 


where  V’  ,  Bce  and  f  are  given  by  equations  (6),  (54)  and  (77).  For  incoherent 
spacetime  these  integrals  reduce  to  the  standard  expressions  for  the  T  »  0  Fermi 
gas.  For  coherent  spacetime  with  8pp  *  tt/ 2  ,  3ee  *  n / 2  ,  V'  •  V0y  ,  p  -  pc  * 
constant,  and  e  ■  ec  *  constant  it  follows  from  equations  (257)  and  (260)  or 
(259)  and  (262)  that  for  internal  space  rotations  of  the  momentum  and  single 
particle  energy 


n  »  4/3ttV0  /h3  p3  | Pc  *  4/3iTV8,,/h3  (p3  -  p3) 
v  V  c  c 

-  2/3fV0v  e3/2|£c  =  2/3fV0v(E^/2  -  e3/2) 


(263) 


(264) 


where 


Pc  *  Pc  exp(jm/6) 


(265) 


ec  *  ec 


(266) 


Combining  equations  (263)  through  (266)  gives  for  the  T  *  0  Fermi  gas  in  co¬ 
herent  spacetime 


n  -  4/3rrV0vp^(j  -  l)/h3 

-  4/3TTV0v(2mec)3/2(j  -  l)/h3 

-  2/3fV6ve3/2(j  -  1) 


(267) 


(268) 


(269) 
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Note  that  for  this  case  <  0  .  For  particles  with  spin  the  above  results  must 
be  multiplied  by  (2s  +1)  where  s  =  spin. 

The  internal  energy  of  a  T  *  0  Fermi  gas  in  the  presence  of  gravity  is 
written  as  the  following  complex  number  generalization  of  the  standard  scalar 


result33-35 

U  -  4irV '  /  (2mh3)  /  p4dp  (270) 

O  00  / 

*  4irV' / (2mn  )  J  p  sec  B  exp[j(59  +  B  )]dp  (271) 

o  PP  P  PP 

3  1T^6  5 

=  4irV ' / ( 2mh  )  /  p  esc  B„  exp[j(56  +  B  )]d6  (272) 

o  PP  P  PP  P 

These  equations  can  be  written  equivalently  in  terms  of  the  single  particle 
energy  as 

U  -  fV'  /  i3/2de  (273) 

=  fV*  /  e3/2  sec  Be£  exp{j(5/26£  +  BE£)]de  (274) 

ir/3 

=  fV  /  esc  B  exp[ j (5/20  +  6  ) ]d0^  (275) 

q  C»  t-  t  Lb  L 


For  incoherent  spacetime  all  internal  phase  angles  are  set  equal  to  zero  and 
equations  (270),  (271),  (273)  and  (274)  reduce  to  the  standard  scalar  expres¬ 
sions.  For  coherent  spacetime  with  Bpp  -  it/ 2  ,  Bee  *  tt/2  ,  V’  -  V8V  ,  p  ■  pc  « 
constant,  and  e  ■  ec  *  constant  it  follows  from  equations  (270)  and  (273)  or 
equations  (272)  and  (275)  that  for  the  case  of  internal  spacetime  rotations  of 
the  single  particle  momentum  and  energy 

U  -  2/5irVe_./(mh3)  p5j^  »  2/ 5irV0  / (mh3)  (p5  -  p5)  (276) 

V  pc  V  c  c 

-  2/5fve_.  e5/2|*c  «  2/5fV0v(e3/2  -  e5/2)  (277) 

V  £c  VC  C 

Combining  equations  (265),  (266),  (276)  and  (277)  gives 
U  -  l/5irV8vp3/(mh3)(j  -  /3  -  2) 

-  l/5fV6ve3/2(j  -  /3  -  2) 

For  this  case  Ur  <  0  .  For  particles  with  spin  these  results  must  be  multiplied 
by  (2s  +  1)  where  s  -  spin.  The  average  energy  per  particle  is  obtained  from 
equations  (267),  (269),  (278)  and  (279)  to  be 


(278) 

(279) 


584 


(280) 


-  3/10ec(j  -  /3  -  2)1  (j  -  1) 

■  3/10(1  +  / 3)ec  exp(ji?/6) 

The  real  part  of  equation  (282)  is  given  by 

e  _  -  3/3(1  +  /3)/20  e 
avR  c 

*  0.71  e 

c 


(281) 

(282) 

(283) 


The  result  in  equation  (283)  can  be  compared  with  the  corresponding  result  for 
incoherent  spacetime  which  is  eav  *  3/5ep  -  0.6ep  where  ep  -  Fermi  energy. 

4.  BOSE  GAS  IN  BROKEN  SYMMETRY  SPACETIME.  This  section  calculates  the 
chemical  potential,  internal  energy  and  pressure  of  a  Bose  gas  which  is  located 
in  an  external  field  such  as  gravity  or  an  electromagnetic  field  which  breaks 
the  symmetry  of  spacetime  and  induces  complex  number  values  for  the  spacetime 
coordinates,  particle  momenta  and  single  particle  energies.  This  requires  that 
the  total  particle  number,  chemical  potential,  internal  energy  and  pressure  must 
be  represented  as  complex  numbers. 

A,  Chemical  Potential  of  Bose  Gas. 

For  a  Bose  gas  in  a  gravitational  field  the  complex  number  generalization 
of  the  standard  expression  for  the  total  particle  number  in  a  Bose  gas  is  given 
by 

n  -  £  [1/z  exp(8e  )  -  1]  ^  (284) 

P  P 

00 

=*  [  [  zv  exp(-ve  )  +  z/(l  -  z)  (285) 

p*>  v-1  P 

-  fV’  l  zvHv  +  z/(l  -  z)  (286) 

v-1 

where  V'  ,  f  and  Hv  are  given  by  equations  (6),  (77)  and  (171)  respectively. 
Equation  (286)  can  be  rewritten  as 

n  -  fVf  l  zVHv  exp[j(v9z  +  9Hv>]  +  z/(l  -  z)  (287) 

v«l 

where  the  fugacity  z  is  defined  in  equation  (70) .  The  real  and  imaginary  parts 
of  equation  (287)  are  obtained  using  equations  (64)  and  (65)  as 

N  cosZen  -  fV'  l  zVHv  cos(vez  +  0^)  (288) 

n  v-1  v 

2 

+  z (cos  0  -  z) / (1  +  z  -2z  cos  0  ) 

z  z 
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N  cos  8  sin  8  *  fV'  [  z  H  sin(v8  +  6„  ) 

n  n  L .  v  z  Hv ' 

v=l 

+  (z  sin  8  ) / (1  +  z2  -  2z  cos  8  ) 
z  z 


(289) 


where  N  ■  integer  number  of  bosons.  Equations  (288)  and  (289)  can  be  used  to 
determine  z  and  6Z  from  which  the  chemical  potential  components  u  and  8y  are 
obtained  from  equations  (73)  and  (74) .  From  equation  (66)  it  follows  that  equa¬ 
tion  (288)  gives  the  measured  particle  number. 


For  incoherent  spacetime  all  internal  phase  angles  are  set  equal  to  zero 
and  equation  (288)  becomes 

v  „inc 


N  -  fV  l  z.  H  +  z.  /(I  -  z.  ) 

«  -t  nr*  \)  -i  nr*  '  inn"' 

v-1 

,inc 


(290) 


where  Hy  is  given  by  equation  (183),  and  ZjLnc  is  obtained  as  a  solution  of 
equation  (290) .  Equation  (290)  is  the  standard  textbook  expression  for  the 
particle  number  for  the  case  of  an  ideal  Bose  gas.33-35 


For  the  case  of  a  Bose  gas  in  coherent  spacetime  with  V*  V8y  ,  Bee  «■  tt / 2 
and  e  =»  ec  =  constant,  it  is  easy  to  show  that 


n  -  fV8  l  zV  .HC°h  +  z  /(I  -  z  ) 
V  L ,  coh  v  coh  coh 


v»l 


-  fV0,.e3/2  I  zv  ,  E  +  z  , /(I  -  z  .  ) 
V  c  L,  coh  v  coh  coh 

V”1 


(291) 

(292) 


where  E  is  given  by  equation  (191).  The.  it  follows  that 


N  cosz8  »  fV0„e3/2  f  zv  ,  E  cos(v9con  +  0_  ) 
n  V  c  coh  v  z  Ev 

v»l 

.  Qcoh  W/.  .  2  „  flCoh. 

+  z  .  (cos  0  -z  ,  )/(l  +  z  ,  -  2z  .  cos  6  ) 

coh  z  coh  coh  coh  z 


,coh 


(293) 


N  cos  0  sin  6  -  fV0t,e  £  z  ,  E  sin(v0c  +  0_  ) 

n  n  V  c  coh  v  z  Ev 

v**l 


+  (zcoh  sin  ez°h)/^  +  zcoh  -  2zcoh  cos  ez°h) 


(294) 


where  E^,  and  0^  are  given  by  equation  (194).  Equations  (293)  and  (294)  can  be 
used  to  determine  zcoh  and  0|°^  and  subsequently  Pcoh  and  . 

B.  Internal  Energy  of  Bose  Gas  in  a  Gravity  Field. 

The  internal  energy  of  a  Bose  gas  in  broken  symmetry  spacetime  is  written 
as  the  following  complex  number  generalization  of  the  scalar  result33-35 

U  -  fV'  /  s3/2[exp(0  -  irXde  -  3/2kT  tn(  1  -  z )  (295) 
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where  £  is  given  by  equation  (168).  Expanding  the  integrand  by  an  infinite 
series  gives 

OO 

U  -  fV*  l  zVJ  -  3/ 2kT  £n(l  -  z )  (296) 

v-1  V 


where  Jv  is  given  by  equation  (202).  The  real  and  imaginary  parts  of  equation 
(296)  are  obtained  to  be 

OO 

U-  -  fV'  T  zVJ  cos(v0  +  0_  )  -  3/ 2kT  In  Y  (297) 

R  L.  v  v  z  Jv 

v=>l 


UT  =  fV'  7  zVJ  sin(v6  +  0T  )  +  3/2kT0v 
I  L,  v  z  Jv  Y 

V  — 1 


(298) 


where  Jv  and  0jv  are  given  by  equations  (203),  and  where 

1  -  z  «  Y  exp(-j 0y)  (299) 

Y  *  (1  -  2z  cos  6z  +  zV/2  (300) 

tan  0V  =  (z  sin  0  )/(l  -  z  cos  0  )  (301) 

X  z  z 


The  measured  internal  energy  is  given  by  Um  =  UR  .  The  average  energy  per  parti¬ 
cle  for  the  Bose  gas  in  a  gravity  field  is  given  by 

e  °  U/n  (302) 

av 


where  U  is  given  by  equation  (296)  and  n  is  given  by  equation  (286) . 


For  incoherent  spacetime  all  internal  phase  angles  have  zero  values  and 
equations  (297)  and  (299)  give  the  internal  energy  of  an  ideal  Bose  gas  as 


U 


inc 


£v  l2LA"c  - 3/2tT  -  W 


(303) 


where  Jv  is  given  by  equation  (214) ,  and  z^nc  is  determined  as  a  solution  to 
equation  (290) .  Equation  (303)  represents  the  standard  expression  for  the  in¬ 
ternal  energy  of  an  ideal  Bose  gas. 3 3-3 s 


For  coherent  spacetime  the  internal  energy  is  written  as 

UCOh  -  fV0„  l  zV  .  Jcoh  -  3/2kT  ln(l  -  z  .  )  (304) 

V  u.  coh  v  coh 

v«l 

■  fv9vec/2  - 3/2M  <n(i  -  w  <305) 

v-l 

where  Jv°h  and  Sv  are  defined  by  equations  (220)  through  (222).  The  real  and 
imaginary  parts  of  the  internal  energy  of  a  Bose  gas  located  in  a  strong  grav¬ 
itational  field  is  given  by 
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(306) 


■  ®V!/!  j/coh^  +  6Sv>  -  3/2kI  *»  *«* 

u“h  -  fveve3/2  l  zv  S  Sin(ve'°h  +  8  )  +  3/2kI8“h  (307) 

V  =  1 

where  from  equations  (299)  through  (301) 

1  -  Ecoh  ‘  rcoh  “P(-3f>rh) 

v  /I  ■>  acoh  ,  2  .1/2 

Y  ,  -  (1  -  2z  ,  cos  0  +  z  ,  ) 

coh  coh  z  coh 

,  .coh  .  .  .coh.  <  .  _cohN 

tan  0„  *  (z  ,  sin  0  )/(l-z  .  cos  0  ) 

Y  v  coh  z  coh  z 

The  measured  internal  energy  is  given  by  Ujjj0*1  *  ur°^  • 

C.  Pressure  of  a  Bose  Gas  in  a  Gravity  Field. 

The  pressure  of  an  ideal  Bose  gas  in  a  gravitational 
diately  obtained  from  equations  (236)  and  (296)  to  be 
00 

P  =  2/3f  l  zVJ  -  1/V’kT  £n(l  -  z) 
v-1  v 

This  expression  can  also  be  obtained  from  the  following  complex  number  general¬ 
ization  of  the  standard  scalar  result  for  the  pressure  of  an  ideal  Bose  gas3 3-3 5 

P/(kT)  -  -  f  /  e1/f2  £n[l  -  5  exp(-8e)]de  -  1/V  ln(  1  -  z)  (312) 

Then  using  the  expression 

£n(l  -  x)  -  £  (-1)2V_1  5V/v  (313) 

v-1 

gives 

P/(kT)  -  f  l  zv/v  Hv  -  1/V1  tn( 1  -  z)  (314) 

v-1 

where  Hv  is  given  by  equation  (171).  A  comparison  of  equations  (311)  and  (314) 
shows  that  equation  (237)  is  true  also  for  the  ideal  Bose  gas.  The  real  and 
imaginary  components  of  the  pressure  are  obtained  from  equation  (311)  to  be 

CO 

P„  -  2/3f  l  ZVJ  cos (v8  +  0.  )  -  1/V’kT  In  Y  (315) 

v-1 
«0 

PT  -  2/3f  l  ZVJ  sin(v0  +6.)  +  l/V'kT0v  (316) 

v-1 

The  measured  pressure  is  given  by  Pm  -  PR  . 


(308) 

(309) 

(310) 

field  can  be  imme- 

(311) 
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For  an  ideal  Bose  gas  in  incoherent  spacetime  all  of  the  internal  phase 
angles  are  set  equal  to  zero  and  equation  (315)  becomes 


P  =  2/3f  l  -  V_1kT  ln{  1  -  z.  ) 

inc  inc  v  me 


(317) 


inc 

where  Jv  is  given  by  equation  (243).  Equation  (317)  is  the  standard  expression 
for  the  pressure  and  in  fact  from  equation  (303)  it  follows  that33-35 


P.  »  2/3U.  /V 
inc  inc 


(318) 


which  agrees  with  equation  (236). 


The  pressure  of  an  ideal  Bose  gas  in  coherent  spacetime  is  obtained  from 
equation  (236)  to  be 


-  2/3UC°h/(V9v) 

=  2/3fe5/2  ?  5V  ,  S  -  (ve.,)**1!^  ln(  1  -  z  .  ) 
c  *•,  coh  v  V  coh 

v=l 


(319) 


(320) 


or  from  equations  (306)  and  (307) 

PR°h  '  2/3feo/2  +  9Sv>  '  <VV'lkI  **  \ 

V**l 


pr  ■  2/3fcc  i  zlo hsv  +  eSv>  +  <vv  m 

V=1 


-1kTe£°h 


(321) 


(322) 


where  Sv  ,  Sv  and  9gv  are  given  in  equations  (222)  and  (226) ,  and  where  Ycoh  and 
Sy0  are  given  by  equations  (309)  and  (310).  The  calculations  in  this  section 
can  be  applied  to  the  Bose  gas  of  Cooper  electron  pairs  in  a  high-Tc  supercon¬ 
ductor.  The  totally  coherent  spacetime  state  describes  the  superconducting 
state  of  a  high-Tc  material,  whereas  the  partially  coherent  spacetime  state  can 
be  used  to  describe  the  normal  state  of  a  high-Tc  superconductor. 

5.  THERMONUCLEAR  REACTIONS  IN  THE  PRESENCE  OF  GRAVITY.  This  section  cal¬ 
culates  the  rate  of  thermonuclear  reactions  of  particles  that  exist  in  spacetime 
that  has  broken  symmetries  due  to  the  presence  of  an  external  field  such  as 
gravity.  The  calculations  will  be  of  value  for  predicting  the  thermonuclear 
reaction  rates  in  stars  where  the  gravitational  field  is  sufficiently  strong  as 
to  make  spacetime  partially  coherent  or  even  totally  coherent.  The  conventional 
calculation  of  the  nuclear  reaction  rates  in  stars  assumes  that  spacetime  is 
incoherent.  These  conventional  fusion  rate  calculations  are  treated  extensively 
in  the  literature  so  that  only  a  brief  summary  of  the  results  are  given  here. 

The  thermonuclear  reaction  rate  Rinc  is  written  as1-6 


nln2<^V'> 


n1n./m<op> 


n L n 2 ( 2/m) 1 ^ 2<oe 1 ^ *> 


(323) 


where  o  *»  fusion  reaction  cross  section,  v  *  relative  collision  speed,  m 
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2 

duced  mass  of  the  two  reacting  species  of  particles,  e  *  l/2mv  =  kinetic  ener¬ 
gy  of  the  two  reacting  particles  in  the  center  of  mass  system,  and  where  nj  and 
r^2  -  nuclear  species  particle  number  densities  given  by 

Hj  =  Nj/V  n2  =  N2/V  (324) 

By  way  of  introduction  to  the  problem,  the  thermonuclear  reaction  rate  is  eval¬ 
uated  for  an  ideal  Boltzmann  gas  in  incoherent  spacetime  with  a  zero  external 
gravitational  field.  For  a  Maxwell-Boltzmann  distribution  in  incoherent  space- 
time  the  average  value  that  appears  in  equation  (323)  is  evaluated  using  equa¬ 
tions  (92)  and  (93)  as  follows^-6 

R  =  nin2(2/m)  F  /A  (325) 

«  n1n2(8/(7im)]1/2(kT)"3y2  FinC 

-  4irn  n9  [m/(2rrkT)  ]  3y2  /  exp(-l/2(5mv2)ov3dv 
1  1  o 

inc  inc 

where  A  is  given  by  equation  (93) ,  and  F  is  given  by 

,  00 

pinc  =  J  oe  exp(-Be)de  (326) 

o 

where  e  -  l/2mv^  . 

The  generalization  of  the  standard  result  in  equation  (323)  to  the  case 
of  broken  symmetry  spacetime  that  is  induced  by  an  external  gravity  field  is 
written  as  the  following  complex  number  equation 

R  *  n1n2<ov>  *  n^ii.2hn<ap>  «  (2/m) 1  ^ 2<c c ^ ^ 2>  (327) 

where  R  *  complex  number  thermonuclear  reaction  rate,  o  =  complex  number  thermo¬ 
nuclear  reaction  cross  section  which  is  written  as 

o-o  exp(j0o)  (328) 

and  where  v  ,  p  and  c  -  complex  number  relative  particle  speed,  relative  particle 
momentum  and  total  particle  kinetic  energy  in  the  center  of  mass  system  respec¬ 
tively.  The  complex  species  particle  number  densities  nj  and  are  written  in 
analogy  to  equations  (61)  and  (62)  as 

nl  -  nl  exp(j9nl)  -  n^/V'  (329) 

ft2  -  n 2  exp(j8n2)  -  tS2/V'  (330) 

where 


590 


(331) 


n 


1 


n 


2 


Nj^/V'  cos  0 
N2/V’  cos  6 


nl 

Q«1  " 

9nl 

n2 

6n2  = 

8n2 

(332) 


where  0n^  and  0n~  *  internal  phase  angles  of  species  particle  numbers,  Nj  and 
N2  "  integer  number  of  species  particleT  in  the  volume  V'  ,  and  V'  »  broken 
symmetry  space  volume  given  by  equation  (6) .  For  the  case  of  coherent  space- 
time  it  follows  from  equation  (9)  that  V'  **  V0y  so  that  equations  (331)  and 
(332)  give  the  species  particle  number  densities  as 


coh 


it 


coh 

2 


Nj/(V0v)  cos  0 
N2/(V0v)  cos  8 


coh 

flc°h 

6nl 

"  9nl 

n  1 

coh 

-coh 

9n2 

9n2 

n  2 

(333) 

(334) 


where  0n^  and  0n2  *  internal  phase  angles  for  the  species  particle  numbers  in 

coherent  spacetime.  For  incoherent  spacetime  all  internal  phase  angles  are 
equal  to  zero  and  V'  =  V  so  that  equations  (331)  and  (332)  reduce  to  equation 
(324).  The  thermonucleai  reaction  rates  for  classical  particles,  fermions  and 
bosons  in  the  presence  of  an  external  gravity  field  will  now  be  calculated. 

A.  Thermonuclear  Reactions  in  the  Broken  Symmetry  Spacetime  of  a 
Boltzmann  Gas  in  a  Gravitational  Field. 

For  a  Boltzmann  gas  in  a  spacetime  with  broken  internal  symmetries  due  to 
gravity  the  complex  number  thermonuclear  reaction  rate  is  given  by  equation 
(327) 


<ap>  -  Fp/Ap  =  Fp/Ap  exp(j(0Fp  -  6^)]  (335) 

<5e1/2>  *  F/A  =*  F/A  exp[j(0p  -  0^]  (336) 

where 

F  »  J  op3  exp [-Bp2/ (2m)] dp  (337) 

Ap  -  /  p2  exp [-Bp2/ (2m)] dp  (338) 

F  «  /  oe  exp(-0e)de  (339) 

A  *  J  e^2exp(-Bt)de  (340) 


where  p  and  e  are  represented  by  equations  (45)  and  (50)  respectively  and  where 
dp  and  dc  are  given  by  equations  (47),  (48)  and  (52),  (53)  respectively  for  the 
general  case  of  partially  coherent  broken  spacetime  symmetry  and  by  equation 
(56)  for  the  case  of  coherent  spacetime.  Only  the  single  particle  energy  inte¬ 
grals  appearing  in  equations  (336) ,  (339)  and  (340)  are  considered  in  this  pa¬ 
per.  A  phase  space  factor  fV*  given  by  equations  (6)  and  (77)  has  been  cancelled 
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from  the  numerator  and  denominator  of  equation  (336).  The  integral  A  that  ap¬ 
pears  in  equations  (336)  and  (340)  has  already  appeared  in  equations  (78)  and 
(84)  through  (89).  Using  equations  (52)  and  (53)  allows  the  integral  F  that 
appears  in  equation  (339)  to  be  written  as 

oo 

F  -  /  oe  sec  8£e  exp[j(26£  +  6a  +  Se£)]exp(-0e)de  (341) 

OO 

*  /  ae  sec  8  exp(-8e  cos  0  )exp(j$  )de  (342) 

Q  £  w  £  0  £ 

if/ 3  2 

'  -  J  ae  esc  8  exp(j(20  +  6  +  6  )]exp(-8e)d6  (343) 

q  £  w  £  U  w  £  £ 

ff/3  2 

■  /  ae  esc  B£e  exp(-0e  cos  0£)exp(j<l>CT£)d0e  (344) 

where 

$  »  20  +  6  -  8e  sin  0  +  8  (345) 

ae  e  a  e  ee  v  J 

The  real  and  imaginary  parts  of  F  are  then  obtained  from  equations  (342)  and 
(344)  as 

OO 

F  =  /  ae  sec  8  exp(-8e  cos  0  )cos  ♦  de  (346) 

q  £  t  £  0  £ 

ff/3  _ 

=  /  aeZ  esc  8  exp(-8e  cos  0  )cos  4>  d0  (347) 

q  £  £  £  0  £  £ 

ao 

F_  =  /  ae  sec  B  exp  (-Be  cos  0  )sin  $>  de 

I  JQ  ee  r  e  oe 

ff/3  2 

*  /  ae  esc  8  exp(-Bt  cos  0  )sin  fl>  d9 
JQ  ee  e  ae  e 

from  which  F  and  0p  are  determined  to  be 

F  *  (Fj  +  Fj)l/2  tan  0p  =  F]./FR  (350) 

which  are  required  for  equation  (336)  .  The  internal  phase  angle  of  the  single 
particle  kinetic  energy  is  distributed  in  the  range  0  <  0e  <  n/3  because  for  a 
coherent  spacetime  state  with  8pp  =  ff/2  and  B^t  *  ff/2  it  follows  from  equations 
(23)  and  (51)  that  0e  *  2(0p  -  0C)  with  the  upper  limits  of  8p  and  0t  being 
0p  ■  tr/3  and  0t  =  fr/6  ,32 

The  thermonuclear  reaction  rate  in  a  Boltzmann  gas  for  the  general  case 
of  partially  coherent  spacetime  is  then  obtained  from  equations  (327)  and  (336) 
as 


(348) 

(349) 
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(351) 


_  _  1/2-  - 
R  =■  n1n2(2/m) w  F/A 

1/2 

*  «1m2( 2/m)  F/A  exp[j(6nl  +  6^  +  8p  -  8A> j 

where  A  and  8^  are  given  by  equation  (79)  and  F  and  8p  are  given  by  equation 
(350).  The  real  and  imaginary  parts  of  equation  (351)  are  given  by 

1/2 

\  -  n1«2(2/m)  F/A  cos(0rtl  +  0^  +  6p  -  6^  (352) 

1  /2 

RI  =  K1n2(2/m)i/zF/A  sin(0wl  +  +  8p  -  8^  (353) 

The  measured  nuclear  reaction  rate  is  given  by  R^j  *  RR  .  For  the  case  when  all 
internal  phase  angles  are  set  equal  to  zero  the  measured  thermonuclear  reaction 
rate  given  by  equation  (352)  reduces  to  the  standard  incoherent  spacetime  reac¬ 
tion  rate  given  by  equation  (325) . 

In  the  presence  of  a  very  strong  gravity  field  the  spacetime  can  become 
coherent,  and  following  equation  (351)  the  thermonuclear  reaction  rate  in  a 
Boltzmann  gas  is  written  as 

jcoh  .  Hcoh_coh(2/m)l/25Coh/;icoh  (354, 

where  Acoh  is  given  by  equations  (104)  through  (108),  and  where  for  coherent 
spacetime  with  8ee  *  rr/2  and  e  *  r  «  constant  it  follows  from  equations  (343) 
and  (344)  that 

FCoh-  z2cC  »  e2(Cp  +  jCx)  -  e2C  exp(j8c)  (355) 

where 

ir/3 

C  =  j  /  a  exp[j(28  +  8  )]exp(-8e)d8  (356) 

o  e  a  e 

*/3 

*  j  /  o  exp(-Be  cos  8  )exp[j(28  +8  -  Be  sin  8  )]d8  (357) 

q  C  6  £  (J  C  £  £ 

where  ec  =  constant,  o  =  o(0  )  and  90  *  0o(0e;)  .  The  real  and  imaginary  parts 
of  equation  (357)  are  given  By 
Tf/3 

C_  =>  fa  exp(-8e  cos  8  )sin(8e  sin  8  -  0  -  28  )d8  (358) 

R  ^  c  e  c  e  o  e  e 

w/3 

CT  *  fa  exp(-Be  cos  0  )cos(Be  sin  6  -  8  -  28  )d8  (359) 

Iq  C  e  c  eoee 

and  therefore 

C  -  (CR  +  C*)1/2  tan  0C  -  (360) 

The  expression  for  the  thermonuclear  reaction  rate  in  a  Boltzmann  gas  that  is 
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located  in  coherent  spacetime  is  then  obtained  from  equations  (105),  (354)  and 
(355)  as 

Rcoh  =  n^ohn^°h(2/m)1/2e1/2C/W  (361) 

12  c 

coh  coh,,,,  x  1  /2  1/2„,it  r4,acoh  Qcoh  a  .  N1 
=  n1  n2  (2/m)  ec  C/W  exp[j(0wl  +  6^  +  0C  -  0W)  ] 

where  W  ,  W  and  0y  are  given  by  equations  (104)  through  (110),  and  C  ,  C  and  0c 
are  given  by  equations  (356)  through  (360) .  The  real  and  imaginary  parts  of  the 
thermonuclear  reaction  rate  given  in  equation  (361)  are  written  as 

RC°h  ,  Kcohncoh(2/m)l/2£l/2c/w  cos(8°°h  +  e=oh  +  ^  ^  (362) 

Rcoh  ,  ncohncoh(2/m)l/2El/2c/w  sln(0coh  +  ecoh  +  ^  v  (363) 

The  measured  thermonuclear  reaction  rate  Is  then  given  by  ■  Rg°^  • 

If  the  fusion  cross  section  is  independent  of  0£  ,  it  follows  from  equa¬ 
tions  (355)  through  (360)  that  for  coherent  spacetime 

Fcoh  =  oc£^D  »  oc£^Dr  +  =  °cecD  +  (364) 

where  the  constant  fusion  cross  section  is  written  as 

oc  *  oc  exp(j0oc)  (365) 

and  where 

if/3 

D  =  j  |  exp(j20  )exp(-Be)d0  (366) 

o  G  e 

tt/3 

=  j  J  exp(-Bec  cos  0£)exp[j(20e  -  Bec  sin  e£)]dee 

The  real  and  imaginary  components  of  equation  (366)  are  given  by 
*/3 

D_  *  f  exp (-Be  cos  0  )sin(0e  sin  0  -  20  )d0  (367) 

Rq  C  E  C  £  EE 

if/3 

Dj  =  /  exp(-Bec  cos  0£)cos(6ec  sin  0£  -  20£)d0e  (368) 

so  that 

D  -  (D^  +  Dj)I/,Z  tan  0D  »  <369> 

Then  equations  (104) ,  (354)  and  (364)  give  the  thermonuclear  reaction  rate  in  a 
Boltzmann  gas  that  is  located  in  a  gravity  field  that  is  sufficiently  strong  as 
to  make  spacetime  coherent 


594 


(370) 


RCoh 


n"ohn;oh( 2/m)1/2o  c1/25/W 
i  i  c  c 

-c°h-coh5  -7- 
1  l  c  c 


(371) 


where  D  is  given  by  equation  (366),  W  is  given  by  equations  (104)  and  (106), 
and  where  vc  *  coherent  particle  velocity  which  is  given  by 


l/2mv  =  3/2kT 


v 


(3kTc/m) 


1/2 


(372) 


where  Tc  “_coherent  spacetime  temperature  for  a  Boltzmann  gas.  The  quantities 
oc  ,  D  and  W  are  functions  of  T  and  Tc 

°c  ~  Sc0r,Tc)  D  -  D(T,Tc)  W  -  W(T,Tc)  (373) 


Equation  (371)  can  be  rewritten  as 

jcoh  .  -coh-coh^(3kT^/m)l/25/5 


(374) 


-  ni0hn2°h°c(3kTc/m)1/2D/W  “P  [j(0nf  +  +  6oc  +  0D  "  V  ] 


where  W  and  0W  are  given  in  equation  (110).  The  real  and  imaginary  parts  of  the 
coherent  spacetime  thermonuclear  reaction  rate  for  a  Boltzmann  gas  is  obtained 
from  equation  (374)  as 


ncohncohac(3kTc/m)1/2D/w  cos(e£°h  +  a®®1 

^°hn^ohoc( 3kTc/m) 1 7 2D/W  sin(6^°h  +  0^' 


+  0 


oc 


+  0 


oc 


+  0D 
+  6D 


ew) 


V 


(375) 

(376) 


The  measured  thermonuclear  reaction  rate  for  this  case  is  given  by  • 


The  conventional  incoherent  spacetime  calculation  of  the  thermonuclear 
reaction  rate  given  in  equations  (325)  and  (326)  should  be  compared  to  the  co¬ 
herent  spacetime  prediction  of  the  thermonuclear  reaction  rate  that  is  given  in 
equation  (375) .  The  coherent  spacetime  fusion  reaction  rate  is  expected  to  be 
more  sensitive  to  ambient  density  and  temperature  due  to  the  absence  of  an 
average  of  the  cross  section  over  a  full  range  of  energies  that  appears  in  the 
conventional  calculation  in  equations  (325)  and  (326)  but  does  not  appear  in 
equation  (375)  for  the  coherent  spacetime  case.  The  cross  section  0c(T,Tc)  is 
sharply  peaked  at  those  temperatures  that  correspond  to  resonance  energies. 
Therefore  for  the  coherent  spacetime  case  the  energy  generation  rates  in  stars 
will  be  high  only  in  selected  temperature  zones  that  correspond  to  definite 
radial  distances  from  the  center  of  a  star.  Elsewhere  in  the  star  the  nuclear 
reaction  rates  will  be  low. 


B.  Thermonuclear  Reactions  in  Fermi  Gases  in  the  Presence  of  Gravity. 

For  a  Fermi  gas  in  broken  symmetry  spacetime  the  average  quantity  that 
appears  in  the  thermonuclear  reaction  rate  formula  given  in  equation  (327)  is 


595 


evaluated  using  equation  (167)  for  the  complex  particle  number  as  follows 


<SiU2>  ■  V*D 

(377) 

Kjj  =  /  oe[exp(£)  +  1]  ^de 

(378) 

Kjj  ■  /  e1/^2[exp(C)  +  l]_1d£ 

(370) 

where  1  is  given  by  equation  (168).  A  factor  fV’  has  been  cancelled  from  the 
numerator  and  denominator  of  equation  (377).  The  series  expansion  given  in 
equation  (169)  can  be  applied  to  equations  (378)  and  (379)  with  the  result 


-  I 

V=1 

“  I  (-l)V_1zVMv  exp[j(ve  +  0  )] 

v=l  '  " 


(380) 


Kp  “  I  (-l)V_lzVH 

V=1 

=  1  (-l)V_1zVHv  exp[j(v0z  +  0Hv)] 
v=l 


(381) 


where  Hv  ,  Hy  and  0Hv  are  given  by  equations  (171)  through  (178)  and  My  is 
given  as  follows 

Mv  -  /  5E  exp(-v6e)ds  »  Mv  exp(j@My) 

00 

J  oe  sec  8e£  exp(-vge  cos  0F)exp(j<t,rrc.M)dE 


o 

tt/3 


CSV 


=  /  oe  esc  B££  exp(-v0e  cos  0f.)exp(j'l>rr(.^)d0 


where 


<J>  =  20  +  0  -  \>Be  sin  0  +  B 

oev  e  o  e  ec 


(382) 

)de 

(383) 

)de£ 

(384) 

(385) 

The  real  and  imaginary  components  of  equations  (383)  and  (384)  are 

CO 

M  _  *  /  oe  sec  8  exp(-vBe  cos  8  )cos($  )de 

VR  1  ff  r  '  e'  new' 


O 

TT/3 


ee 


oev 


/  oe‘  esc  exp(-vBe  cos  0F)cos($r[(.^)d0 


oev  e 


(386) 

(387) 
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(388) 


MvI  *  /  ae  sec  8£e  exp(-v8c  cos  0£)sin(<t>  )de 


tt/3 

=  /  oe  esc  8  exp(-v8e  cos  0  )sin(4>  )d0 

o  e£  e  aev  e 


from  which  it  follows  that 

«  /u2  . .2  s  1/2 
M  =  (M  _  +  M  T) 
v  vR  vl 


tan  0  =  M  /M  _ 

Mv  vl  vR 


The  thermonuclear  reaction  rate  is  then  given  by 
R  =  «1M2(2/m)1/2KN/K[) 


(389) 


(390) 


(391) 


where  Kjj  and  KD  are  given  by  equations  (380)  and  (381).  Then  the  real  and  imag¬ 
inary  parts  of  the  nuclear  reaction  rate  in  a  Fermi  gas  is  given  by 


where 


1 12 

\  "  h'h  COsC6nl  +  »«2  +  6KB  -  «W 

(392) 

1  /2 

h  '  ni “2(2/">  V*D  5ln(6«l  +  6«2  +  9KN  *  <W 

(393) 

kn  =  (4  +  4>1/2 

tan  9km  1Wknr 

(394) 

kd-  <4  +  4>i/2 

ta”  6kd  ■  Si'Sr 

(395) 

k®  -  l  (-D^X 

cos(v9z  +  eMv) 

(396) 

V=1 

hi  - 

sin(v0  +  0.,  ) 
z  Mv 

(397) 

V»1 

■  i 

cos(v0z  +  0Hv) 

(398) 

V“i 

Si .  ?  (-.)v-!.\ 

V=1 

sin(v02  +  0Rv) 

(399) 

where  Mv  and  0jjv  are  given  by  equation  (390) ,  and  Hv  and  0gv  are  given  by 
equation  (178).  The  measured  thermonuclear  reaction  rate  is  given  by  Rq  «  Rg  . 

Consider  now  the  case  of  thermonuclear  reactions  in  a  Fermi  gas  that  is 
located  in  incoherent  spacetime  (zero  gravity  field) .  For  this  case  all  in¬ 
ternal  phase  angles  are  zero  and 


<ce1/2>  -  Kjnc/K^nc 


(400) 
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where 


V_1  v  inc 


*T  '  !<-»  ’i»c« 

v=l 


(401) 


.v-1  v  „inc 


*T  -  l.wrvte< 

v— I 


(402) 


i.Tic  inc 

where  Hv  is  given  by  equation  (183)  and  My  is  obtained  from  equation  (386) 
to  be 


/  oe  exp(-v8e)de 
o 


•  (403) 


which  is  similar  to  equation  (326)  for  a  Boltzmann  gas  that  is  located  in  in¬ 
coherent  spacetime.  For  the  case  of  incoherent  spacetime  the  thermonuclear 
reaction  rate  in  a  Fermi  gas  is  given  by 


„inc  .  l/2r,inc  ,„inc 

R  «  (2/m)  Kjj  /Kp 


(404) 


where  and  r) 2  are  given  by  equation  (324). 

The  thermonuclear  reaction  rate  for  a  Fermi  gas  that  is  located  in  a 
coherent  spacetime  state  associated  with  an  intense  gravity  field  requires  the 
calculation  of  the  following  average  value 


<3e1/2>  -  K^°h/^°h 


where 


-coh  „coh  ....coh. 

> 

*/3  2  -  -1 

=  j  f  oe  [exp(5coh)  +1]  d6? 


(405) 


(406) 


=coh  -.coh  ..-coti. 

S  ■  5)  “P«eKD  > 

0/9  ^  —  1 

-  j  J  e  /Z[exPacoh)  +  I]"  deE 


(407) 


where 


?coh  "  6(e  "  *3coh) 


zcoh  "  exP(f5Ccoh> 


(408) 


A  phase  space  factor  fV6y  has  been  cancelled  from  the  numerator  and  denominator 
of  equation  (405).  The  integrals  in  equations  (406)  and  (407)  can  be  rewritten  as 


j?coh  ,  Y  Mcoh 

*N  K  (  °  ZcohMv 


(409) 


c  .  nv-l  v  ,,coh  ...  Qcoh  ,  .coh,.-, 
)  (-1)  z  .M  exp[j(v0  +  6„  )] 

coh  v  J  z  Mv 


598 


(410) 


*r  -  so„sr 


V  /  nv-l  v  „coh  ,,,  acoh  ,  .cohx  ■. 

‘V  ’  SonS  expt;l<',0z  +0Hu>’ 

V  =  1 


where  ,  H^°h  and  0^°^  are  S*ven  by  equations  (188)  through  (194),  and 

is  given  by 


scoh  wcoh  /.acohv 
M  -  M  exp(j0w  ) 
v  v  r  J  Mv 


je  /  a  exp(-vge)exp(j  20  )d0 

C  c.  £ 


(411) 


2  ' 

je  fa  exp(-v8e  cos  6  )exp[j(20  +6  -  vfle  sin  0  )]d0 

C  n  C  €  £  C  C  £  € 


The  real  and  imaginary  components  of  equation  (411)  are  given  by 

Oil  /m  r  _ _ /  _  .  es  _  n  \  _  •  _  r  r\  _  j  .  a  a  a  a  \ 


=  Ec  j  °  exp(-u0ec  cos  0e)cos(v8ec  sin 
and  therefore 

Mcoh  r/„coh.2  ,  .^coh. 2, 1/2 
S  ■  1 (M,R  >  +  <M„I  >  1 

-  "c;h /M”h 
Mv  vl  vR 

The  component  forms  of  equation  (409)  are  given  by 

^coh  r  ,  .  v- 1  v  „coh  ,  acoh  ,  acotu 

'Sir  -  V'1’  2cohMv  c°s(v0z  +6mv> 

V**l 

*,coh  v  /  , \V-1  v  „coh  .  ,  Qcoh  ,  Qcoh* 

Si  SohS  sln(',9z  +6mv> 

V»1 

and  the  component  forms  of  equation  (410)  are 

..coh  r  ,  nv-l  v  „coh  ,  „coh  .  „coh. 

Sr  "  V  *  SohS  cos(',9z  +  V> 

V=i 


sin  0  -  20  -  0  )d0 

e  e  o  e 

(412) 

sin  0  -  20  -  0  )d0 

e  e  a  e 

(413) 

(414) 

(415) 

by 

) 

(416) 

) 

(417) 

) 

(418) 

c  -  v-»'"1soh»rh  2i"<90rh  +  C’ 

V“1 


which  gives 


K^°h  -  [<)2  +  (K^°h)2]1/2 
K^,h  .  ((K^°h)2  +  (K^°h)2]l/2 


(419) 

(420) 

(421) 
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(422) 


coh  coh,„coh 

tan  9KN  hi  /KNR 

coh  coh,  coh 

tan  6KD  hi  /KDR 


(423) 


Then  the  general  expression  for  the  thermonuclear  reaction  rate  for  a  Fermi  gas 
in  coherent  spacetime  is  obtained  from  equations  (327)  and  (405)  through  (423) 
to  be 


Rcoh  -  n“hn=°h(2/«)1/2K£oh/i£oh  (424) 

whose  real  and  imaginary  parts  are  written  as 

(£*  -  n“h«fh(2/»)1/2^oh/i^oh  co3(6“h  4  e“h  4  9^h  -  e"h)  (425) 

R"h  -  nJohr.“h(2/m)1/2K^oh/^°h  sln^f  4  e'f  4  9^h  -  e"h)  (426) 


The  measured  reaction  rate  is  given  for  this  case 


,  _coh 

by  ^ 


Some  simplification  can  be  made  for  the  case  of  fusion  reactions  occurring 
in  a  Fermi  gas  with  coherent  spacetime  if  the  thermonuclear  cross  section  is  a 
constant  which  is  independent  of  the  internal  phase  angles,  and  for  this  case 
equations  (411)  through  (415)  become 


HC°h 

V 

=>  5  eh 

c  c  V 

(427) 

coh 

MvR 

2 

»  a  e  D  cos (0  +  0_  ) 

c  c  v  oc  Dv 

(428) 

MCf 

vl 

=»  a  e^D  sin(0  +  0  ) 

c  c  v  oc  Dv 

(429) 

HCOh 

V 

=>  a  e^D 

c  c  V 

(430) 

„coh 

0Mv 

*  0  +  0_ 
ac  Dv 

(431) 

where 

tt/3 

Dv  - 

j  /  exp(-vBe)exp(j20  )d0 

O  EE 

(432) 

tt/3 

»  j  /  exp(-vfie  cos  0  )exp[j(28  -  vge  sin  8  )]d0 

q  C  S  E  C  t  t 

The  real  and  imaginary  parts  of  equation  (432)  are  given  by 
ir/ 3 

D  .«  /  exp(-vBe  cos  0  )sin(vBe  sin  0  -  28  )d8  (433) 

vR  £  c  e  c  eee 

rr/3 

D  T  -  /  exp(-v8e  cos  0  )cos(v8e  sin  8  -  28)d0 

vl  i  c  e  c  e  e  e 


c00 


(434) 


from  which  it  follows  that 


D  =  (D2  +  D2  )1/2 

v  vR  \>I 


tan  6_  =  D  _/D  D 

Dv  vl  vR 


(435) 


Therefore  for  this  case  equations  (409)  and  (410)  can  be  written  for  the  Fermi 
gas  in  coherent  spacetime  as 


-coh  _  2-coh 


{7Coh  3/2=coh 

h  ec  S 


where 


^coh  „coh  ,.Qcohs  r  ,  ,->v-l_v  - 

S  “p(j6GN  >  ■  V'”  zcohDv 

V=  1 


Ch  =  Ch  -pcjO  - 1  KohK 


(436) 


(437) 


(438) 


(439) 


where  equations  (437)  and  (439)  are  obtained  from  equation  (410)  by  using  equa¬ 
tion  (188)  where  Ey  is  given  by  equation  (190).  The  magnitudes  and  phase  an¬ 
gles  that  appear  in  equations  (438)  and  (439)  are  calculated  as  follows 


c  -  [02  +  <ov/2 

<°h  -  (<<°V  +  <°V]l/2 


(440) 


(441) 


tan  6cn  "  <Wgnr 
C  -  %i/gdr 


where  from  equations  (438)  and  (439) 

- 1  <-i)V‘lzLhDv  +  env> 


(442) 


(443) 


(444) 


.  H- i)v-VohDv  sin(ve-h  +  eDv) 


(445) 


I  <-1>V'lzVcohEv  cos(vez6h  +  9Ev> 


(446) 


£  <-1)V'lzcohEv  sin(ver  +  V 


(447) 


where  Dy  and  0qv  are  given  by  equation  (435) >  and  Ev  and  Qgy  are  given  by  equa¬ 
tion  (194).  The  thermonuclear  reaction  rate  for. a  Fermi  gas  in  coherent  space- 
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(.448) 


time  is  then  obtained  from  equations  (424) ,  (436)  and  (437)  as 
scoh  „  -coha=oh(2/m)L/2gc°h/i;coh 

=  Sf°hn“h(2/m)1/2o  cl/2Gfh/G=°h 
12  c  C  N  D 

-coh-coh  -  -coh  ,-coh 
*  n.  n.  v  o  G.,  /G_ 

1  2  c  c  N  D 

coh  coh  ^coh/.coh  r . ,.coh  .  Qcoh  .  Q  .  Qcoh  .coh. , 
■nl  n2  vc0cS  /GD  +  “„2  +eoc  +  eGN  -eGD)1 

where  ,  0^°^.  and  0^2^  are  given  by  equations  (333)  and  (334),  and 

vc  =  coherent  spacetime  relative  particle  speed  given  by  equation  (372) .  The 
real  and  imaginary  parts  of  the  complex  number  thermonuclear  reaction  rate 
given  by  equation  (448)  are 


„coh  coh,  coh  ..coh^coh  /Ocoh  .  „coh  .  Q  .  Qcoh  Qcoh 
*  n,  n„  v_o_G„  /Gn  cos(0..,  +  e..„  +  e  +  e _ -  e 


kl 


c  c  N 


nl 


n2 


oc 


GN 


GD 

_coh  coh  coh  „coh,„coh  .  ,acoh  ,  _coh  .  B  ,  Qcoh  Qcoh 
R_  »  H.  n„  v  o  G„  /G_  sm  0  ,  +  6  .  +e  +  9__,  -  0__ 

I  1  2  c  c  N  D  nl  n2  oc  GN  GD 


(449) 
)  (450) 


coh 


The  measured  thermonuclear  reaction  rate  is  given  by  Rm  *  R] 


,coh 


C.  Thermonuclear  Reactions  in  a  Bose  Gas  Located  in  a  Gravitational  Field. 


For  the  case  of  thermonuclear  reactions  in  a  Bose  gas  situated  in  space- 
time  with  gravity  induced  broken  internal  coordinate  symmetries  the  reaction 
rate  formula  in  equation  (327)  requires  an  averaged  quantity  which  is  evaluated 
using  the  complex  number  Bose-Einstein  distribution  as  follows 


<oe1/2>  -  Ljj/Lp  (451) 

where 

**  “  hi  exp(j9LN)  (452) 

-  I  a  e*/2[l/z  exp  (Be  )  -  ll'^fV’)”1  (453) 

p  P  P  P 

=  /  oe [exp (£)  -  11  ^de  (454) 

h)mhD  eXp(J9LD)  (455) 

-  I  [1/5  exp (Be  )  -  l]"l(fV')‘1  (456) 

P  P 

-  /  f1/2[exp(5)  -  l]-1de  +  5(1  -  *)"1(fV,)“1  (457) 


where  £  and  f  are  given  by  equations  (168)  and  (77)  respectively.  The  reciprocal 


square  bracket  terms  in  equations  (452)  through  (457)  can  be  expanded  in  a  power 
series  in  a  manner  similar  to  that  in  equations  (284)  through  (286)  with  the 


result  that 

QD 

£n  -  l  (458) 

v=l 

00  .  - 

Cjj  =  I  ZVHV  +  z(l  -  z)~l(fV’)-1  (459) 

v=«l 

where  Mv  is  given  by  equation  (382)  and  Hv  is  given  by  equation  (171).  From 
equations  (452)  through  (459)  it  follows  that 

Si  =  ^LNR  +  LNI^  (460) 

s- (i4  +  i4>1/2  ««> 


tan  °LN  *  Si^Sr 

“n  SLD  "  LDI/LDR 

where 

Sr  "  Z,2X  c°s(u9z  +  W 

V=1 

Si  *  sln(uS  +  W 

V  =  1 


(462) 

(463) 

(464) 

(465) 


S)R  *  l  Z\  CQS<V0Z  +  eHv)  +  8  (466) 

v=l 

00 

Ljjj  l  *  H  sin(ve2  +  eHv)  +  h  (467) 

v«l 

with 

g  =  (fV)  *z(cos  6  -  z)/(l  +  z2  -  2z  cos  0  )  (468) 

z  z 

h  =  (fV*)  *(z  sin  9  )/(l  +  z2  -  2z  cos  9  )  (469) 

Z  2 

In  the  limit  of  high  temperatures  equations  (71)  and  (72)  show  that  z  *  1  and 
0Z  *  0  and  therefore  g  *  0  and  h  *  0  .  Then  the  thermonuclear  reaction  rate  for 
a  Bose  gas  in  the  presence  of  gravity  is  obtained  from  equations  (327)  and 
(451)  to  be 

R  -  n1n2(2/m)1/2LN/LD 


(470) 


The  real  and  imaginary  parts  of  the  thermonuclear  reaction  rate  are  given  by 

“r  *  cos<e„i  +  e«2  +  eLN  -  Old'  <47‘> 

Rj  *  rtjrt^/ra)  Sln<6nl  +  en2  +  6ln  "  6Ld'  (472> 


The  measured  reaction  rate  is  given  by  =  RR  . 

For  incoherent  spacetime  the  thermonuclear  reaction  rate  of  a  Bose  gas 
is  obtained  by  first  noting  that  equation  (451)  becomes 


<oe 


1/2> 


inc  /T  inc 


me 


^  /LD 


where 


C  -  I/mX 


v  „mc 


v=l 


L^nC  =  l  z*.  Hinc  +  (£V’)"lz.  (l-z.  ) 

D  L,  me  v  xnc  me 

v=l 


r  t  'R  “ ‘1- 


-1 


(473) 

(474) 

(475) 


inc  inc 

where  Mv  and  Hv  are  given  by  equations  (403)  and  (183).  Finally,  the  nu¬ 
clear  reaction  rate  for  this  case  is  obtained  from  equations  (327)  and  (473)  as 

_inc  .  1/2T  inc /T  inc 

R  -  n1n2(2/m)  1^  /Ljj  (476) 


which  is  the  standard  result. 

For  thermonuclear  reactions  in  a  Bose  gas  in  the  presence  of  gravity  that 
is  so  strong  as  to  make  the  spacetime  coherent,  the  relevant  averaged  quantity 
that  appears  in  equation  (327)  is  written  as 


^ — 1/2.  rcoh  .-coh  Tcoh,.coh  r./Qcoh  „coh. , 

<oc  /ld  /ld  -0ld)! 


where 


tt/3 


l^°h  -  j  /  crc2[ejtp(Ccoh)  -  1]  "1dei 


£“h  .  1 1  ;"‘ie*Pacoh)  -i rld8E  +  (fV6v)-‘icoh(l  -  5coh) 


o 

■tr/3 


-3/2 


-1 


-1- 


.-1 


(477) 


(478) 


(479) 


where  Ccoh  is  given  by  equation  (408) .  Expanding  the  integrands  in  a  power 
series  yields 


-coh  r  _v  -coh 

L._  *  >  z  ,  M 

N  coh  v 

v=l 

rcoh  r  -v  scoh  ,  ,e„a  .-1-  , ,  -  . -1 

h>  ■  i,zcohHv  +  (£VV  zcoh(l  -  zcoh> 

v-1 


(480) 

(481) 
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The  component  forms  of  equations  (480)  and  (481)  are  written  as 

Tcoh  r  v  ..coh  t  Qcoh  ,  „coh. 

SjR  =  ^  ZcohMv  COS(v9z  +  V  > 

v=l 

T coh  v  v  wcoh  .  /  ..coh  ,  .coh. 

Si  ■  i,ZcohMv  Sln(v6z  +  V  > 

V  =  1 


(482) 

(483) 


Tcoh  _  r  v  ucoh  ,  ,coh  , 

Sr  ‘  &*«*"»  “s(  * 

V=1 


.coh 

Hu 


)  +  8 


l“h  -  l 


DI 


v=l 


'coh  u 


coh 


v  „coh  .  ,  ..coh  ,  _coh.  ,  , 
z  ,  H  sm(v0  +  6„  )  +  h 

z  Hu 


coh 


(484) 


(485) 


where  M^0*1  ,  M^0>1  and  0^°^  are  given  by  equations  (411),  (414)  and  (415);  5^°^  , 
and  0g°^  are  given  by  equations  (188)  and  (189);  Zg^  and  O^0*1  are  given  by 
equations  (197)  and  (198),  and  where  in  analogy  to  equations  (468)  and  (469) 

g  .  =  (fve  )_1Z  (cos  eCoh  -  z  . )/(l  +  Z2  ,  -  2z  ,  cos  ecoh)  (486) 
coh  V  coh  z  coh  coh  coh  z 


h  -  (fV0„)_1(z  .  sin  0COh)/(l  +  z2  -  2z  ,  cos  0coh) 

coh  V'  coh  z  v  coh  coh  z  ' 


(487) 


The  magnitudes  and  phase  angles  that  appear  in  equations  (477)  are  given  by 

(488) 


coh  r.  coh.  2  ,T  coh.  2.1/2 
Si  “  KLNR  >  +  (LNI  }  J 


coh  _  coh.2  ,  coh. 2. 1/2 
LD  kldr  '  +  (LDI  '  J 


(489) 


„coh  T  coh  /T  coh 

Ca"  “in  ■  Si  7  Sr 

Qcoh  Tcoh._coh 

tan  8id  ‘  Si  7  Sr 


(490) 

(491) 


Then  the  general  expression  for  the  thermonuclear  reaction  rate  in  a  Bose  gas 
located  in  coherent  spacetime  is  given  by 


jjcoh  -coh-coh  .  . l/2=-coh,-coh 
R  n2  (2'm>  LN  7Ld 

„coh  coh,  cohf„,  .  l/2Tcoh  ,Tcoh  ,Qcoh  .  ..coh  .  Qcoh  „coh. 
S  -  *,  n2  (2/m)  Ljq  %  cos(enl  +  ert2  +  eLN  -  eLD  ) 

R^°h  -  r.Zohn'°h(2/m)172lJoh/^oh  Si„(9a°h  +  9 “h  +  9^h  -  9a°h) 


(492) 

(493) 

(494) 


The  measured  reaction  rate  is  given  by  R^oh  *  R^otl  • 

For  the  case  when  the  thermonuclear  reaction  cross  section  for  a  Bose  gas 
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in  coherent  spacetime  is  a  constant  which  is  independent  of  6e  ,  then  equations 
(427),  (480)  and  (481)  give 


-coh  -  -2-coh 
L.,  *  c  e  C„ 

N  c  c  N 

-coh  3 /2-coh 


h 


V  V  +  <fVV'‘5Coh(1  -  5<=oh>'‘ 


where 


N 


exp(je^h)  -  l  z'  .  D 


v=l 

coh 


'coh  v 


l  zcohDv  exp[j(v07  +  0nJ5 


v-1 


D'J 


(495) 

(496) 


(497) 


-coh 

°D 


_coh  . .-coh.  r  -v  - 

CD  exp<J9CD  >  ‘  i,zCohEv 
V—  1 


=  I  zV  i,E  exp[j(v0  +  0  )] 

coh  v  r  J  z  Ev 

v=l 


(498) 


where  Dv  and  Ey  are  given  by  equations  (432)  and  (190). 
and  (498)  it  follows  that 


coh 

NR 

rcoh 

Si 


r  V  _  /  Qc°h  , 

)  7.  ,  D  cos(v0  + 

^  rrth  \J  7 


V»1 


*  \  z  ,  D  sm(v0  + 
u.  coh  v  z 

v«*l 


Dv 


Dv 


) 

) 


From  equations  (497) 

(499) 

(500) 


rcoh 

°DR 

rcoh 

Dl 


m  1  z  kE 
L.  coh  v 

v»l 


£  zcohEv 


V*1 


cos (V0^oh 

sin(v6C0^ 

z 


+  v 


+  eEv^ 


and  therefore 
rcoh  » 

S 

Cc°h„ 


t(Ch)2 

t<CDf)2 


+ 


+ 


.coh. 2. 1/2 
^N1  }  J 

.coh. 2. 1/2 
'  Dl  '  J 


(501) 

(502) 


(503) 

(504) 


tan  0 


coh 

CN 


,coh 

NI 


/C 


coh 

NR 


tan  0 


coh 

CD 


rcoh 

°DI 


/C. 


coh 

DR 


(505) 


For  the  case  of  a  constant  fusion  cross  section  ac  independent  of  9E  it  follows 
from  equations  (495)  and  (496)  that  for  a  Bose  gas  in  coherent  spacetime 
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.  coh 

Sir  = 

„  2. coh 

0  e  C„ 
c  c  N 

cos(0 

ac 

r  coh 

Si  = 

2„coh 
o  e  CXT 
c  c  N 

sin(6 

oc 

T  coh 

3/2  coh 

.coh 

Sr  " 

e  Cr. 

c  D 

C0S  ®CD 

T  coh 

3/2-coh 

.coh 

Si  ' 

S  CD 

Sln  9cd 

c°h 
CN  ; 

,coh. 


+  g 


coh 


+  h 


coh 


(506) 

(507) 

(508) 

(509) 


where  gco^  and  hco^  are  given  by  equations  (486) 
(488),  (506)  and  (507)  and  from  equations  (489), 


and  (487).  From  equations 
(508)  and  (509)  it  follows  that 


coh 

'N 


2ncoh 

0  £ 
c  c  N 


(510) 


.coh  r  3,„coh.2  ,  .  3/2„coh,  .coh  ,  ,  .  „eoh. 

S  *  [S(S  >  +  2ec  S  <gcoh  COS  eCD  +  hcoh  Sl"  eCD  > 

2  ,  .2  .1/2 
+  g  ,  +  h  ] 
coh  coh 


(511) 


Combining  equations  (490),  (5061  and  (507)  gives 

o"h  -  8  +  e'°h 

LN  oc  CN 


(512) 


while  is  obtained  from  equations  (491),  (  >8)  and  (509) 

w  .coh  .  3/2„coh  .  _coh  ,  ,  N/,  3/2„coh  .coh  ,  x 

tan  eLD  -  <ec  CD  sin  8^  +  hcoh>/Uc  ^  c.s  +  g^) 


(513) 


Then  equations  (492)  through  (494)  coupled  with  equations  (510)  through  (513) 
give  the  thermonuclear  reaction  rate  for  a  Bose  gas  in  coherent  spacetime  and 
for  a  constant  fusion  cross  section. 


If  the  ground  state  of  the  Bose  gas  in  coherent  spacetime  is  unoccupied 
due  to  high  temperatures  then  zcoh  ■  1  and  0^°  «■  0  from  equations  (71)  and 

(72),  so  that  equations  (486)  and  (487)  give  gcoh  ”  0  and  hcoh  **  0  and  equation 
(511)  becomes 


coh  3/2„coh 

*  C  C 
D  c  D 


(514) 


while  equation  (513)  gives  for  this  case 


-coh 

LD 


,coh 

CD 


(515) 


Combining  equations  (493),  (494),  (510),  (512),  (514)  and  (515)  gives  the  real 
and  imaginary  parts  of  the  thermonuclear  reaction  rate  as 
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„coh  coh  _coh,_coh  ,Q 

ni  n2  VcCN  /CD  COS(0 

coh  coh  „coh  ,_coh  .  /a 

"i  "2  VA  /cd  sln<e 
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+ 
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nZ 
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CN 

CD 

coh  + 

ec°h  + 

6 

+ 

e™h 

-  ecoh 

nl 

nZ 

ac 

CN 

CD 

(516) 

(517) 


where  Cjj  ,  0^  ,  and  0£D  are  defined  in  equations  (497)  through  (505), 

and  where  vc  is  given  by  equation  (372) - 

There  are  three  reasons  for  the  lowered  rates  of  some  thermonuclear  reac¬ 
tions  compared  to  the  standard  reaction  rate  calculations.  The  first  reason  is 
that  gravity  induces  a  broken  spacetime  symmetry,  and  the  thermonuclear  reaction 
cross  section  for  coherent  spacetime  is  evaluated  at  the  prevailing  single  value 
of  the  kinetic  energy  per  nucleus  ec  so  that,  unless  this  kinetic  energy  is  as¬ 
sociated  with  a  resonance  value  for  the  cross  section,  the  thermonuclear  reac¬ 

tion  rate  will  be  low  compared  to  the  incoherent  spacetime  calculation  of  the 
standard  model  which  assumes  a  distribution  of  kinetic  energies.  The  second 
reason  for  the  lowered  thermonuclear  reaction  rate  is  the  appearance  of  a  co¬ 
sine  factor  in  the  predicted  measured  values  of  the  reaction  rate.  The  argu¬ 
ment  of  the  cosine  term  consists  of  the  sum  of  the  internal  phase  angles  of  the 
two  reacting  species  particle  number  densities,  the  internal  phase  angle  of  the 
fusion  cross  section,  and  the  internal  phase  angles  associated  with  the  coherent 
spacetime  average  of  the  relative  speed  of  the  interacting  particles.  The  third 
reason  for  the  depressed  thermonuclear  reaction  rate  compared  to  the  standard 
prediction  is  due  to  the  effects  of  a  real  gas  state  equation  which  enters  the 
renormalization  group  time  equation  given  in  equations  (1A)  and  (IB). 


6.  CONCLUSION.  It  is  suggested  that  in  very  strong  gravitational  fields 
spacetime  becomes  coherent  and  that  the  motion  in  such  a  state  occurs  internally 
so  that  the  space  and  time  coordinates,  single  particle  momenta,  and  single 
particle  energies  change  coherently  in  that  they  rotate  in  internal  space  with 
their  magnitudes  held  fixed.  The  state  equations  for  the  noninteracting  Boltz¬ 
mann,  Fermi  and  Bose  gases  are  developed  for  partially  and  totally  coherent 
spacetime.  These  complex  number  state  equations  reduce  to  the  standard  ideal 
gas  state  equations  for  the  case  of  incoherent  spacetime.  Thermonuclear  reac¬ 
tion  rates  in  bulk  matter  are  lowered,  compared  to  the  standard  predictions 
which  assume  an  ideal  gas,  by  the  real  gas  nature  of  the  state  equations  of 
stellar  matter  as  described  by  a  renormalization  group  time  equation.  These 
reaction  rates  will  also  be  lowered  by  the  degree  of  coherence  of  spacetime  be¬ 
cause  the  values  of  the  integrals  over  momentum  or  kinetic  energy  that  are  re¬ 
quired  to  calculate  the  thermonuclear  reaction  rates  are  dependent  on  the  state 
of  coherence  of  spacetime.  The  thermonuclear  reaction  rates  have  been  evalu¬ 
ated  for  the  ideal  Boltzmann  gas,  Fermi  gas  and  Bose  gas  for  partially  coherent 
and  totally  coherent  spacetime.  The  lowered  thermonuclear  reaction  rates  are 
due  to  real  gas  effects  in  the  relativistic  equation  for  time  and  to  the  effects 
of  the  coherence  of  spacetime  on  the  reaction  rate  integrals  for  the  ideal 
gases,  and  may  represent  an  explanation  for  the  depressed  values  of  the  measured 
solar  neutrino  emission  flux  compared  to  predictions  of  the  standard  solar  model . 
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1.  Introduction  Fine  scale  morphology  or  microstructure  is  fmplicated  in  the  macroscopic 
behavior  of  many  materials,  but  the  manifestations  of  this  are  often  unclear1.  We  are  in  need  of 
improved  methods  for  studying  this  frequently  encountered  situation.  In  this  report  we  describe  in 
an  expository  fashion  the  initial  developments  of  one  such  technique  which  has  been  applied  in 
several  instances  especially  related  to  certain  alloys  or  other  crystalline  materials.  Good  examples 
where  defect  structures  consisting  of  fine  scale  morphology  are  relatively  simple  are  certain  phase 
transformations  of  displacive  or  structural  type  and  the  mechanical  behavior  of  shape  memory 
alloys.  Martensitic  materials,  in  particular,  exhibit  fine  twinned  microstructures,  often  appearing  as 
layers  or  layers  within  layers2.  Although  we  often  refer  to  inicrostructure,  we  are  confronted  with 
a  primarily  continuum  phenomenon  for  which  some  authors  use  the  term  mesoscale.  In  these 
considerations,  one  issue  is  paramount:  the  presence  of  spatially  oscillatory  behavior  and  the 
means  of  understanding  it  constitutes  the  bridge  from  the  fine  scale  to  the  large  scale. 

Crystals  are  idealized  as  materials  with  a  high  degree  of  configurational  order.  As  a 
consequence,  the  continuum  energy  densities  ascribed  to  them  are  invariant  under  discrete  groups 
and  have  multiple  potential  wells.  Such  densities  are  not  lower  semicontinuous.  The  infimum  of 
energy  may  be  obtained  only  in  some  generalized  sense,  while  a  minimizing  sequence  may  develop 
successively  finer  oscillations.  Said  in  another  fashion,  when  the  material  deforms  owing  to 
change  in  its  environment,  the  configurational  order  acts  as  a  constraint  resulting  in  the  creation  of 
a  defect  structure,  which  in  this  case  is  a  complicated  spatially  oscillatory  fine  structure.  The  limit 
deformation  alone  need  not  be  sufficient  to  characterize  many  of  the  properties  of  the  limit 
configuration. 

1  Supported  in  part  by  the  Army  Research  Office. 

2  For  illustrations  of  oscillatory  behavior  in  alloys  and  other  materials  see  [1,2,3,4,5.27,53]. 


613 


A  feature  of  the  constitutive  theory  under  discussion  is  that  surface  energies,  magnetic 
domain  wall  energies,  and  similar  effects  are  neglected,  although  the  highly  nonlinear  potential  well 
structure  for  the  material  has  a  prominent  role.  Thus  fine  phase  laminar  twin  systems  and  fine 
phase  magnetic  domain  structures  may  tend  to  limits  of  infinite  fineness.  The  theory  in  this 
formulation  delivers  useful  information  about  variant  arrangement  and  location  as  well  as 
macroscopic  state  functions  like  energy  and  stress.  It  is  particularly  useful  in  deciding  where  in  the 
body  fine  structure  will  arise. 

At  the  analytical  level,  we  apply  a  recently  developed  averaging  method,  briefly  explained 
in  §2  below,  which  accounts  for  rapidly  spatially  varying  systems  and  accomodates  the  fine  scale 
microstructure.  A  configuration  which  minimizes  a  given  variational  princple  is  described  in  terms 
of  generalized  moments  of  the  minimizing  sequence,  or  equivalendy,  oscillatory  statistics.  The 
most  important  property  of  the  method  is  to  unify  energetic  and  kinematic  considerations  by 
compelling  the  statistics  to  be  consistent  with  the  variational  principle. 

Examples  of  this  sort  of  analysis  served  to  generalize  the  crystallographic  theory  of 
martensite.  Ball  and  James  [1],  and  to  compute  the  relaxation  of  energy  densities  in  the  presence  of 
symmetry,  Chipot  and  Kinderlehrer  [9]  and  Fonseca  [26].  It  has  subsequently  played  a  role  in 
many  discussions  related  to  microstructure,  eg,  [2,4,5,10,12,13,14,15,25,27,28,30,31,32, 
33,35,39,40,41,45,47],  A  treatment  of  the  variational  foundations  of  this  method  is  given  in 
[29,36,37,38].  Kohn  [42,43]  has  shown  how  these  ideas  and  those  of  relaxation  in,  general  are 
consistent  with  the  treatment  of  Khachaturyan  and  Roitburd,  eg.  [34,49].  Here  we  shall  briefly 
explore  two  examples:  a  theory  for  highly  magnetostrictive  iron/rare  earth  alloys  and  a 
mathematical  example  of  evolution  of  fine  structure.  A  major  impetus  for  these  investigations  is  to 
provide  a  basis  for  the  numerical  computation  of  configurations  with  complicated  microstructure. 
A  few  selected  results  of  these  efforts  will  be  reported. 


2.  Local  spatial  averages  Young  measures.  We  describe  the  portrayal  of  microstructure  or 
fine  structure  by  local  spatial  averages  or  Young  measures  .  We  also  explain  the  mechanism  by 
which  these  averages  serve  to  unify  energetic  and  kinematic  considerations.  Since  this  may  not  be 
familiar  to  most  readers,  we  give  some  examples  as  well.  A  bounded  sequence  of  functions  or 
more  general  fields,  scalar,  vector,  or  matrix  valued, 

f*:  Q  RN  k  =  1,2 .  (2.1) 
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may  describe  a  spatially  oscillatory  structure  or  system  in  the  region  Q.  For  example,  if  Q  is  a 
cube,  f0  a  fixed  periodic  function,  and 

^(x)  =  f0(kx). 


the  system  represents  spatial  oscillations  modulated  in  some  fashion  by  f0.  A  specific  one 
dimensional  example  is 

f*(x)  =  sinitkx,  0  £  x  £  1,  k  =  1,2,3...  (2.2) 


Another  one  is 


f*(x) 


r 


-1 


<  x  < 


£  x  £ 


i 

k 


1  £  j  £  k,  0  £  x  £  1.  (2.3) 


The  general  sequence  ( f*  )  may  fail  to  converge  pointwise  or  even  in  the  mean,  as  the 
examples  (2.2)  and  (2.4)  above  illustrate.  This,  it  turns  out,  is  characteristic  of  the  minimizing 
sequences  for  functionals  which  lack  lower  semicontinuity  and  in  particular  of  variational  problems 
associated  to  crystalline  solids  in  the  context  of  finite  elasticity. 


The  behavior  of  the  sequence  may  be  grasped  by  computing  limits  of  averages 


f(a) 


lim 

p  — »  oo 


lim 

k  -♦  oo 


I  Bp  I 


J fk  dx, 
Bp(a) 


(2.4) 


where  I  Bp  I  stands  for  the  volume  of  the  ball  of  radius  p.  This  tells  us  only  the  average  limit  of 
the  sequence,  however,  and  does  not  inform  us  of  its  particular  oscillatory  behavior.  The  technical 
name  for  this  convergence  is  weak*  convergence.  To  overcome  this,  we  calculate  generalized 
moments.  Let  \|f  be  any  continuous  function  and  consider  the  sequence  (^(i*)).  Although  this 
sequence  need  not  converge,  we  may  ascertain,  as  above,  a  weak  limit  function 


V(a) 


lim 

p  -+  ■» 


k 


lim 


1 

OO  I  Bp  I 


JvCf*)  dx. 
Bp(a) 


(2.5) 


The  association 

V  -»  y(a) 
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gives  rise  to  an  integral  representation  (a  probability  measure)  on  \y, 

V(a)  =  J  y(X)  dva(X)  (2.6) 

RN 

which  has  the  property 

dx  -4  J  ydx  for  any  subset  E  c  Cl.  (2.7) 

This  collection  of  measures  v  =  ( vx  )x  6  n  summarizes  the  statistics  of  the  spatial 
oscillations  of  the  sequence.  It  was  introduced  by  Young  [54]  to  study  control  problems.  Its  first 
use  in  differential  equations  is  due  to  Tartar  [50,51]  who  studied  hyperbolic  conservation  laws. 
They  are  measures  defined  on  the  range  of  the  sequence  (f*)  which  depend  on  the  point  xe  CL 

In  particular,  it  is  generally  incorrect  to  suppose  that  the  limit  of  a  minimizing  sequence 
realizes  the  infimum  of  energy  in  a  variational  principle  whose  minimizing  sequences  are  highly 
oscillatory.  The  minimum  energy  must  be  evaluated  using  (2.6). 

Examples 

For  example,  both  the  sequences  of  (2.2)  and  (2.3)  have  f(x)  =  0.  On  the  other  hand,  for 

(2.2), 

1 

-p=r  ,  0  <  a  <  1,  (2.8) 

-i 

while  for  (2.3), 

y(a)  =  |0|f(-l)  +  y(l))  ,  0  <  a  <  1.  (2.9) 

The  oscillatory  statistics  of  the  two  sequences  are  thus  quite  different 

Let  us  now  give  a  simple  well  known  example  of  how  oscillations  may  arise  in  the 
mathematical  context  The  first  of  these  is  the  familiar  Young-Zermelo  tacking  problem,  [54].  Let 
<p(X)  be  a  double  well  potential  with  equal  wells  at  -1  and  1  as  depicted  in  Figure  1  and,  with  £2 
=  (0,1)  an  interval,  set 
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I(V)  =  J  (<P(V)  +  V2)  dx. 


(2.10) 


,  vtU”1 

A  minimizing  sequence  ( uk )  for  this  functional  wishes  to  enjoy  both  =  ±  1  for  all  k  and 

uk  — »  0  in  £1  The  result  is  the  generation  of  oscillations,  with  a  typical  minimizing  sequence 
given  by  uk  with 


Figure  1  A  typical  double  well  potential  in  one  variable. 


(2.11) 

with  the  f*  defined  in  (2.3).  The  Young 
measure  solution  of  the  minimization  problem 
is  given  by,  cf.  (2.9), 

v  =  i(8_i  +  S+i).  (2.12) 

In  this  example,  oscillations  are  created  by 
competition  between  the  two  terms  of  the 
functional.  In  multivariable  problems,  side 
conditions,  like  boundary  conditions,  are 
sufficient  to  give  rise  to  an  oscillatory  structure. 

Interestingly,  it  is  difficult  to  decide  this  from  a 
computational  standpoint  because  the  additional. 


competition  between  the  grid  orientation  and  the  particular  kinematics  organization  of  the 
configuration  requires  a  sufficiently  large  computational  domain  as  well  as  certain  other  features. 
We  are  examining  these  issues  with  Nicolaides. 


The  propagation  of  oscillations,  and  even  the  convection  of  oscillations  is  an  important 
issue.  Tartar  has  investigated  this  in  some  detail  [52],  recently  introducing  the  H  measure  to 
account  for  aspects  of  the  frequency  distribution  of  a  sequence  as  well. 


3.  Magnetostriction  A  remarkable  feature  of  ferromagnetic  materials  is  that  the  single 
domain  state  is  generally  unstable.  This  constrasts  with  martensite,  where  the  single  variant 
configuration  is  stable  for  arbitrarily  large  samples.  In  the  blue  phase  of  cholesteric  liquid  crystals, 
the  failure  of  stability  of  the  uniform  state  relative  to  an  array  of  defects  is  termed  frustration.  Our 
theory  here  could  be  interpreted  as  one  possible  interpretation  of  this  phenomenon  at  a  macroscopic 
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scale.  The  frustration  in  our  system  arises  from  the  competition  of  an  anisotropy  energy  which 
demands  constant  magnetization  strength  and  direction  with  an  induced  field  energy  which  prefers 
to  tend  to  zero.  A  consequence  of  this  is  to  promote  development  of  a  fine  scale  structure  which 
seeks  to  compromise  the  constraint  of  constant  magnetization  strength. 

Certain  iron/rare  earth  alloys  display  both  frustration  and  a  huge  magnetostriction.  There 
are  cubic  Laves  phase  RFe2  (R  =  rare  earth)  compounds,  for  example,  where  magnetically 
induced  strains  "overwhelm  the  conventional  thermal  expansion  of  the  material",  Clark  [11]. 
TbDyFe2  (terfenol)  solidifies  from  the  melt  with  a  complex  highly  mobile  domains  consisting  of 
structural  domains  and  magnetic  domains.  Typical  growth  habits  result  in  configurations  with 
parallel  twinned  layers,  cf.  Figure  2,  that  persist  in  the  magnetostrictive  process.  We  have  been 
studying  this  with  a  theory  of  magnetoelastic  interactions  based  on  the  micromagnetics  of  W.  F. 
Brown,  Jr.  [6,7,8]  and  the  symmetry  considerations  introduced  by  Ericksen  [16-24],  For  a 
complete  discussion,  we  refer  to  James  and  Kinderlehrer  [32].  It  has  some  similarities  with 
Toupin's  theory  of  the  elastic  dielectric  [54].  We  then  apply  it  to  the  equilibrium  microstructure  of 
TbDyFe2-  The  primary  mechanism  of  magnetostriction  appears  to  be  an  exhange  of  stability  of 
mechanical  variants  under  the  influence  of  a  change  in  the  magnetic  field,  but  we  do  not  discuss 
this  in  detail  here. 

For  relatively  rigid  materials  one  may  assume  the  free  energy  to  depend  on  magnetization 
alone,  [30,31].  The  theory  in  this  case  gives  good  qualitative  agreement  with  experiment, 
explaining  why  cubic  magnets  have  a  few  large  domains  and  why  uniaxial  ones  have  a  fine 
structure.  Domain  refinement  at  the  boundary  is  also  predicted  when  the  normal  to  the  boundary 
has  a  suitable  orientation  with  relative  to  the  crystal  axes,  in  agreement  with  observations. 


A 

<  tn> 


<-211  > 


Figure  2.  Schematic  depiction  of  the  microstructure  in  a  sample  of  TbDyFe2 
illustrating  the  herringbone  structure  of  two  sets  of  laminar  fine  structures. 
Crystallographic  directions  are  with  reference  to  the  high  temperature 
nonmagnetic  phase. 


The  variational  principle  is  formulated  in  terms  of  a  stored  energy  density  which  depends 
on  the  defc',mation  gradient  F  e  M,  3  x  3  matrices,  magnetization  (per  unit  mass)  m  e  R3,  and 
temperature  0  e  R.  We  suppose  it  given  by  a  nonnegative  function 

W(F,  m,  6)  F  e  M,  m  e  R3,  0  €  R,  (3.1) 

subject  to  the  condition  of  frame  indifference 

W(QF,mQT,0)  =  W(F,  m,  0),  Q  6  SO(3),  (3.2) 

and  material  symmetry 

W(FP,  m,  0)  =  W(F,  m,  0),  P  €  P,  (3.3) 

where  P  is  a  crystallographic  point  group. 

Requiring  W  to  depend  on  the  deformation  gradient  F  =  Vy  and  magnetization  m  but 
not  on  V2y  and  Vm  indicates  that  any  energy  associated  with  mechanical  twin  walls  and  Bloch 
walls  is  neglected.  In  this  formulation,  there  may  be  infinitely  fine  twins  or  infinitely  fine  magnetic 
domains,  as  we  have  suggested  earlier.  Since  on  a  macroscopic  level,  the  materials  of  interest 
display  highly  mobile  domain  configurations,  any  wall  energies  need  be  very  small.  The  analytical 
benefit  is  that  in  the  limit  of  infinite  fineness  we  are  able  to  determine  rather  accurately  the 
arrangement  and  location  of  variants  within  the  material,  although  not  their  dimensions. 

Let  y  denote  the  spatial  variable  and  H  and  M  denote  the  magnetic  field  and  the 
magnetization  (dipole  moment  per  unit  volume),  respectively.  In  the  spatial  configuration. 
Maxwell’s  equations  hold.  In  addition,  material  is  magnetically  saturated.  For  an  appropriate 
choice  of  units,  and  introducing  U(y)  as  a  potential  for  H, 

divy  (-VyU  +  M)  =  0  in  R3,  (3.4) 

and  the  field  energy  density  is  given  by 

|lH|2  =  |iVyUI2. 

The  saturation  constraint  leads  to 

I  —  |  =  f(0)  in  the  body,  (3.5) 

P 


619 


where  p  is  the  density. 

The  domain  Q.  is  interpreted  as  an  undistorted  single  crystal  above  the  Curie  temperature. 
By  an  abuse  of  notation,  let  y(x)  denote  the  deformation  of  Q  to  y(£2),  assumed  for  the 
purposes  of  discussion  to  be  1:1.  Since  p(x)  =  l/detVy(x),  the  magnetization  per  unit  mass 
previously  introduced,  m  =  det  Vy  M,  so  the  constraint  (3.5)  assumes  the  form 

Iml  =  f(0). 

We  assume  f(9)  =  1,  without  loss  in  generality. 

In  this  fashion  we  may  write  the  virtual  energy  of  the  configuration  y  =  y(x),  m  =  m(x) 
in  the  mixed  reference/spatial  form 

E(y,m)  =  J"  W(Vy,m,0)  dx  +  ^  jlVyUI2dy  (3.6) 

subject  to  the  constraints, 

divy  (-VyU  +  — - —  m)  =  0  in  R3.  (3.7) 

det  Vy 

I  m  1  =  1  in  y(Q). 

From  (3.7),  we  may  also  write  the  energy  in  the  form 

E(y,m)  =  fw(Vy,m,0)dx  +  ^  f  — — -  m-VyU  dy  .  (3.8) 

d  1  J  detVy 

y(«> 

Both  for  computational  and  analytical  reasons,  it  is  useful  to  express  this  in  terms  of 
reference  variables  alone.  For  this,  introduce  u(x)  =  U(y(x)),  so  Vu(x)  =  VyU(y(x))F(x), 
F(x)  =  Vy(x).  With  C  =  F^F,  the  constraint  equation  (2.9)  becomes 

div(  -Vu  CHdet  F  +  mF_T )  =  0  in  R3 ,  (3.9) 

and  the  saturation  condition  is  simply 

Iml  =  1  inn.  (3.10) 

The  virtual  energy  of  y  =  y(x),  m  =  m(x)  in  reference  form  is 
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E(y,m)  =  J*  W(Vy,m,0)  dx  +  |  j  Vu  C-1-Vu  det  F  dx,  (3.11) 

subject  to  (3.9)  and  (3.10).  Analogous  to  (3.8),  we  may  also  write  (3.1 1)  as 

E(y,m)  =  J"  W(Vy,m,0)  dx  +  -  |  Vu-mF'^  dx.  (3.12) 

The  symmetry  condition  (3.3)  induces  a  potendal  well  structure  on  W.  The  arrangement 
of  these  potential  wells  determines  the  possible  fine  structure.  Our  schema  for  understanding  this 
well  structure  begins  by  choosing  for  P  the  symmetry  group  of  a  putative  high  temperature  non¬ 
magnetic  parent  phase  of  the  material.  For  example,  in  the  case  we  shall  consider  here,  P  is  the 

cubic  group  of  order  24:  relative  to  a  cubic  basis,  these  are  the  proper  orthogonal  matrices  of  the 

form  P  =  ( pij ),  p*j  =  ±1  or  0.  This  is  the  appropriate  assumption  for  TbDyFe2-  For  0  < 
0O,  we  assume  there  exists  a  pair  (Ui,mj)  with  1  mj  I  =  1  and  Ui  =  Uj  positive  definite 

satisfying 

W(Ui,mi,0)  <  W(F,m,0)  for  FeD.  lml  *1.  (3.13) 

Generally,  Ui  and  mi  depend  on  temperature.  The  conditions  (3.2)  and  (3.3)  imply  the 
existence  of  other  minima  by  (2.9).  Assume  that  the  full  set  of  minima  is  determined  by  the  orbits 
of  (Ui,mi)  under  these  actions.  Thus 

inf  W  =  W(RUiH,miRT,0)  <  W(F,m,0)  for  R  e  SO(3),  He  P 

and  F  €  M,  I  m  I  =  1,  with  (F,m)  *  (RUiHmiRT).  (3.14) 

The  potential  wells  are  described  as 

(RUimiRT),  R  e  SO(3), 

(RU2,m2RT),  R  €  SO(3), 

(RUn,mnRT),  R  €  SO(3), 


where 

{(Ui,m1),(U2,m2),...(Un,mn)  }  =  {  (QUiQT,miQT):  Qe  ?}. 
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An  orbit  of  the  form  (RUi,mjRT),  R  €  SO(3),  will  be  called  a  variant  by  analogy  to  martensitic 
transformations. 

Our  idea  of  a  variational  principle  is  to  find  a  pair  (y,m)  such  that 

E(y,m)  =  inf  {  E(q,n):  Ol,H)  subject  to  (3.9)  }. 

However,  in  our  situation,  with  the  material,  in  essence,  uniaxial,  this  will  not  be  possible. 
Instead  we  must  content  ourselves  with  this  result,  whose  verification  relies  on  an  explicit 
construction: 

inf  E  =  minWIQl.  (3.15) 


4.  The  variational  context 

4.1  The  variational  context:  energetics 

Consider  the  minimization  question  associated  to  (3.8)  subject  to  (3.9).  By  choosing  a 
special  sequence  of  magnetizations,  one  may  show  that 

inf  E(y,m)  =  min  W I Q I ,  (4.1) 

as  discussed  at  the  end  of  §3.  However,  because  of  the  competition  between  the  field  energy  and 
the  stored  energy,  there  cannot  be  any  pair  (y*,m*)  with  y*  affine  and 

E(y*,m*)  =  min  W I  £2 1 .  (4.2 

We  are  led  in  this  manner  to  consider  a  sequence  of  deformation  fields  and  magnetization 
fields  (yk,mk)  subject  to  (3.9)  for  which  (dependence  on  0  suppressed) 

E(yk,mk)  -4  min  W I  Cl  I.  (4.3) 

and 

Vyk  -4  Vy  and  mk  -4  m, 
where  the  convergence  is  in  the  sense  of  (2.4),  or  equivalently,  (2.7). 

The  only  way  for  (4.3)  to  occur  is  if 
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W(yk,mk)  ->  minW  and  \  1 1  VyUk  I2  dy  ->  0.  (4.4) 

Since 

W(yk,mk)  — »  W(x),  for  x  e  £2, 

W(x)  =  J  W(A,n)  dvx(A,p.)  , 

MxS2 

we  must  have  that  the  set  of  (A,jj.)  charged  by  v,  that  is  the  support  of  the  measure  v,  is 
contained  in  the  minimum  energy  wells  described  by  (3.17).  In  analytical  terms  we  write 

suppv  C  {(A,p):  W(A,p)  =  min  W  }  =  X.  (4.5) 

In  addition,  (4.4)  provides  via  the  constraint  equation  in  (3.9)  that 

divy - — -  mk  — »  0  in  H-!(R3).  (4.6) 

y  det  Vyk 

(4.5)  and  (4.6)  place  severe  constraints  on  the  possible  forms  of  Vy  and  m. 

4.2  The  variational  context;  kinematics 

An  easy  integration  by  parts  shows  that  if  ( yk )  is  a  sequence  of  deformation  fields  with 
bounded  derivatives,  then  for  any  minors  MfVy^  of  the  matrices  Vyk, 

M(Vyk)  ->  M(Vy) 

in  the  sense  of  (3.4),  that  is,  in  the  weak*  sense.  Thus  minors  are  special  functions  \p(A)  which 
are  continuous  with  respect  to  this  convergence.  They  are,  of  course,  the  null-Lagrangians.  The 
Young  measure  relation  also  holds.  So,  in  the  present  situation,  combining  (4.5)  with  the  Young 
measure  representation  gives 


Vy(x)  =  j 

[  A  dvx(A,p.)  , 

tt 

(4.7) 

adj  Vy(x)  = 

J  adj  A  dvx(A,p.)  ,  and 

(4.8) 
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(4.9) 

where  adj  A  stands  for  the  classical  adjoint  of  A  and  det  A  stands  for  the  determinant  of  A. 
Formula  (4.7)  j  simply  a  restatement  of  (3.4)  in  this  case  and  is  included  to  provide  a  complete 
list  of  null-lagrangians.  We  refer  to  (4.7)  -  (4.9)  as  the  minors  relations. 

These  relations  place  extremely  strong  restrictions  on  the  nature  of  possible  equilibrium 
configurations  because  they  assert  that  the  limit  statistics  of  equilibrium  configurations  must  be 
compatible  with  the  potential  well  structure  of  the  macroscopic  bulk  energy. 

It  is  worthwhile  pointing  out  that  for  the  special  case  of  an  infinitely  Fine  laminate  supported 
on  two  deformation  gradients  Mi  and  M2,  that  is, 

\g(A)  dvx(A,p)  =  (l-0)\|r(Mi)  +  0  y(M2),  (4.10) 

for  some  0,  0  <  0  <  1,  the  minors  relations  imply  that 

M2  -  Mi  =  a®n  =  rank  one,  (4.11) 

and  the  {  M, }  may  represent  the  deformation  gradients  of  twin  related  variants  with  normal  n.  A 
sequence  of  deformations  which  determines  (4.10)  with  9  =  ^  is  given  by 

Vyfc(x)  s  Mi  +  j(l  +  l*(x-n))a®n,  x  €  Q,  (4.12) 

where  f^(t)  is  defined  in  (2.3). 

Analogous  formulas  hold  for  any  problem  in  thermoelasticity,  but  in  magnetostriction  we 
also  have  a  relation  about  magnetization  owing  to  (3.12).  This  relation  is  most  useful  in  reference 
coordinates.  Recall  that 

m(x)  «  f  p.  dvx(A,p)  .  (4.13) 


detVy(x)  =  |det  A  dvx(A,p) 
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The  Phase  Transition  in  TbDyFe2 


k 


Figure  3  The  phase  transition  in  terfenol. 


The  new  relation  is  that 

m(x)Vy(x)"T  =  j[  |lA_T  dvx(A,^) , 


(4.14) 
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with 


div  (m(Vy)_T)  =0  in  R3.  (4.15) 

4.3  Application 

In  Terfenol-D,  onset  of  ferromagnetism  is  associated  with  a  stretch  of  the  unit  cell  along  a 
main  diagonal  parallel  to  the  magnetization.  In  simplified  kinematics,  we  find  a  pair  (Ui,mi) 
satisfying  (2.15)  provided  by 

Ui  =  1  +emi®mi  and  mi  =  -]=  (1,1,1),  lei  small,  (4.1) 

VI 


for  a  suitable  choice  of  coordinates.  The  other  potential  wells  are  determined  by 
Uj  =  1  +  e  mj  ®  m, ,  i  =  2,3,4,  with 


m2  =  -p  (-1,1,1), 
VI 


m3  «  -7=  (1,-1, 1),  H14  =  -7=  (1, 1,-1). 


VI 


VI 


(4.2) 


Since  -mj  is  also  an  admissible  magnetization,  there  are  a  total  of  eight  potential  wells. 
We  regard  the  coordinates  chosen  so  that  this  represents  the  lower  laminate  in  Figure  1.  The  upper 
laminate  is  obtained  from  it  by  a  rotation  about  the  mi  axis.  Note  that  this  is  not  a  symmetry 
operation  of  the  original  energy  and,  although  holding  invariant  the  well  of  (Ui,mi),  gives  a 
different  set  of  wells.  To  save  space,  here  we  discuss  only  the  lower  laminate.  To  properly  treat 
the  entire  system,  we  must  introduce  an  inhomogeneous  energy  W(F,m,0,x),  x  e  £2,  cf.  [32]. 

To  establish  our  result  we  wish  to  check  that  we  may  produce  a  minimizing  sequence 
(yk,mk)  for  the  energy  E(y,m)  with  the  potential  well  structure  determined  by  (4,1)  and  (4.2) 
whose  statistics,  as  determined  by  the  "minors  relations"  and  their  generalizations,  (3.13)  - 
(3.15),  (3.20),  and  (3.21),  deliver  the  observed  crystallographic  data,  for  example,  of  the  lower 
laminate  of  Figure  1.  We  are  able  to  do  this  using  the  wells  determined  by  (Uj.mi)  and 
(U2,m2). 

Given  any  pair  of  transformation  strains 

Ui  =  1+  £^{®^i  and  U2  =  l+£§2®^2» 

I  I  =  1,  and  £2  independent, 

then  the  type  I  and  type  II  twins  (or  twins  and  reciprocal  twins)  have  normals 
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n+  =  Sl+^2  and  n~  =  5i-£2. 
There  are  rotations  R^f)  and  vectors  a *(£)  with 


(4.3) 


Ui  =  R±U2  (1  +  a±®n±) 

In  this  case  with  =  mj,  n+  =  (100)  and  n~  =  (Oil),  in  agreement  with  the  observations  of 
D.  Lord  [44,53]. 

A  coherent  laminate  may  be  constructed  from  the  deformation  gradients  Ui  and  R+U2  or 
from  the  deformation  gradients  Ui  and  R"U2,  cf.  also  (3.16)  -  (3.19).  We  may  construct  a 
compatible  sequence  of  magnetizations  mk  with  =  ±mi  in  the  Ui  regions  and  mk  = 

±  m2(R+)T  in  the  R+U2  regions  with  the  property  that  the  limit  average  m  =  0  so  that 

lim  E(yk,mk)  =  min  W  I Q  I, 

cf.  Figure  2  below. 


Figure  4.  The  equilibrium  microstructure  of  a  laminate  with  parameters 
predicted  by  the  theory.  The  gray  arrows  represent  directions  of  the 
magnetization  within  the  mechanical  layers.  In  the  Uj  layers  they  are 
±  mi  where  mi  is  a  <  111 )  direction  and  in  the  shaded  layers  they 
are  ±  m2(R+)T  where  m2  is  a  (-111)  direction. 

It  is  possible  to  deduce,  moreover,  that  the  only  magnetization  distributions  consistent  with 
the  mechanical  laminate  have  m  =  0.  One  may  explicitly  write  a  Young  measure  solution 


627 


v*  =  _X-)(5(Ui,mi)  +5(Ui,-m1))  +  2*'(5(R+U2.m2(R+)T)  +  8(R+U2,-m2(R+)TP’ X  £  Qf 

where  0  <  X  <  1. 

Our  analysis  suggests  however  that  Figure  2  above  is  not  the  only  solution  and  need  not  be 
the  only  one  the  laboratory  photographs  show  either.  A  laminate  may  also  be  realized  with 
deformation  gradients  U3  and  RU4  which  has  the  same  appearance  on  an  (01—1)  plane.  Note 
that  m3  +  m4  II  (100).  This  configuration  has  the  property  that  it  is  exactly  compatible  across  the 
(111)  plane  whereas  compatibility  of  the  U 1  and  RU2  laminate  is  only  in  the  fine  structure  limit 
and  requires  X  constant.  Interestingly,  the  fine  structure  laminate  might  display  greater 
magnetostriction. 

The  computation  of  configurations  is  underway  by  Ma,  who  has  successfully  reproduced 
hvsteretic  behavior  in  linearly  magnetostrictive  models.  Previously,  Luskin  and  Ma,  [45],  studied 
the  rigid  ferromagnet. 

5.  Evolution  Evolution  problems  for  potentials  which  are  not  convex  may  be  considered 
within  this  framework.  The  basics  of  an  existence  theory  were  given  in  [40]  and  has  been 
significantly  advanced  by  Demoulini  and  Walkington,  in  work  which  has  not  been  published. 
Walkington,  in  particular,  has  adapted  method  for  computing  solutions  of  the  Young-Zermelo 
problem  to  evolution,  for  reasons  which  will  become  clear  momentarily. 

Consider  (p  as  in  Figure  1,  a  scalar  valued  potential  for  example,  and  ask  for  a  solution  of 
the  problem,  q(X)  =  tpx(X), 

-  div  q(Vu)  +  =  0  in  Qx  (0,<») 


u|ftx(0)  “  00 


(5.1) 


U  I  d£l  x  (O.oo) 


0 


A  classical  solution  need  not  exist  because  the  equation  may  be  backward  parabolic  in  some 
regions,  but  we  may  seek  a  Young  measure  solutions  along  the  lines  we  have  been  discussing. 
We  find  this  solution  by  adapting  an  implicit  scheme. 


The  functional 
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(5.2) 


I(v)  =  J*((p(Vv)  +  ^j<v  -  w)2)  dx,  h  >  0, 

is  only  slightly  different  from  (2.10).  Given  h  >  0,  set  tk  =  kh  and  uh-°  =  Uq.  Solve 
iteratively  for  Young  measures  and  underlying  deformations  uh>k  by  the  minimization 
procedure 

J*  (<p(Vv)  +  ^(v  -  u1*^-1)2)  dx  — >  min  (5.3) 

where  the  competing  v  €  H*(£2),  for  example.  The  vh»k  and  u^  satisfy 

f  (<p  +  ^(v  -  uh*k-1)2)  dx  =  min,  (5.4) 

<p(x)  =  f  (p(A.)  dv^’k(X)  ,  with 

rN 

suppvh^  c  [\:  tp(X-)  »  (p**(X)},  (5.5) 

where  <p**  denotes  the  convexified  cp,  which  is  its  relaxation  in  this  situation.  Moreover,  it  is 
possible  to  show  that 

-div  q?1  +  ^(uh*  -  uh>k-1)  =  0,  where  (5.6) 

qb(x)  =  f  qHD  dvhx’k(X)  . 

rN 

We  next  assemble  the  vh>k  and  u*1*,  defining  uh  to  be  the  linear  interpolant  of  the 
( uh* )  and  vh  the  piecewise  constant  in  time  measure  equal  to  v*1*  in  (k  -  l)h  <  t  <,  kh.  We 
obtain  in  this  fashion  a  family  of  approximants  which  are  "maximally  dissipative"  because  of  (5.5) 
and  converge  to  a  Young  measure  solution  of  (5.1). 
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Figure  6  Computation  of  the 


TlfTT 


Figure  5.  The  relaxation  <p**  of  <p  plays  an  important 
role  in  the  solution  of  the  problem.  Where  <p  is  different 
from  <p**  is  shown  in  dotted  lines. 


It  turns  out  that  the  solution  uh-k  of 
the  basic  variational  principle  is  the  unique 
solution  of  the  relaxed  problem  for  this 
principle  which  is  strictly  convex  in  v,  but 
notin  Vv.  Some  convex  analysis  then  tells 
us  that  uh  is  unique  and  so  is  its  limit  u  as 
h  ->  0.  The  Young  measure  need  not  be 
unique. 

This  is  an  opportunity  to  study 
the  pure  generation  of  oscillations  in  a  new 
context  The  underlying  solution  is  known. 
What  statistics  arc  possible  for  the  sequences 


which  generate  this  solution  and  can  we  compute  them?  How  do  they  evolve  in  time?  Using  an 
algorithm  specifically  designed  to  compute  Young  measures  by  Nicolaides  and  Walkington, 
Walkington  has  computed  several  examples  of  this  behavior,  shown  in  Figure  6.  Although  in 
these  pictures,  the  solution  decreases  monotonically  to  its  limit 


“<*•“>  =  {  1  -  X 


0  <  x  :<  0.5 
0.5  <  x  <  1 


this  is  not  the  case  for  general  initial  values  u(x,0),  although  the  energy  integral  is  a  decreasing 
function  of  L  We  refer  also  to  their  article  in  these  proceedings. 


The  author  wishes  to  thank  his  colleagues  and  coworkers,  especially  R.  James,  R. 
Nicolaides,  P.  Pedregal,  and  Noel  Walkington,  for  their  assistance  and  advice  in  preparing  this 
account. 
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ABSTRACT.  Various  finite  difference  schemes  were  presented  in  the 
Ninth  Army  Conference  on  Applied  Mathematics  and  Computing  held  at 
University  of  Minnesota  Super  Computing  Center.  It  is  of  interest  to 
know  where  classical  finite  element  method  stands  against  its 
counterpart.  Towards  this  goal,  the  finite  difference  equivalent  is 
derived  by  the  use  of  Gurtin's  variational  principle,  introduced 
during  mid  60s,  for  linear  initial  value  problems  and  finite  element 
synthesis.  The  body  is  divided  into  various  finite  elements.  Linear 
temperature  distribution  within  each  element  and  linear  temperature 
variation  within  each  time  step  are  allowed  as  in  classical  finite 
element  method.  The  resulting  equation  is  found  to  be  same  as  one  of 
the  finite  difference  equation  derived  earlier,  i.e.,  equal  weights 
at  all  nodes  of  a  27-node  element  except  at  the  center.  This  is  not 
the  best  one  as  far  as  accuracy,  stability  and  nonoscillation 
characteristics  are  concerned.  A  question  is  now  posed  to  the 
readers  about  consequences  of  liberalizing  the  two  basic  assumptions 
of  classical  finite  element  method. 

INTRODUCTION.  The  motivation  for  this  task  comes  from  the  needs  of 
another  project,  aerodynamic  heating  of  hypervelocity  projectiles. 

The  transient  three  dimensional  heat  conduction  model  will  provide  a 
means  to  determine  the  temperature  distribution  as  a  function  of 
location  (three  dimensions)  and  time  for  any  given  initial  and 
boundary  conditions.  The  boundary  conditions  are  usually  obtained 
from  computational  fluid  dynamics  models.  There  are  occasions  where 
there  is  a  strong  coupling  between  convective  flow  and  conduction. 

No  matter  what  method  is  utilized,  all  require  large  computer  storage 
and  a  great  amount  of  computer  time.  The  solution  process  is  not 
only  subjected  to  these  restrictions  but  also  bound  to  blow-up,  in 
the  middle,  if  proper  selection  of  numerical  techniques  and  accuracy, 
stability  and  nonoscillation  characteristics  are  not  taken  into 
account.  If  a  combined  convection  and  conduction  problem  is 
attempted  in  one-step,  the  failure  in  one  area  can  lead  to  a  losing 
proposition  in  both  areas.  Therefore,  a  search  is  initiated  to  find 
an  accurate,  robust  and  efficient  numerical  scheme  for  the  solution 
of  transient  three  dimensional  heat  conduction  problems. 


♦College  of  Engineering,  Rutgers  University,  New  Brunswick,  NJ 


635 


VARIATIONAL  PRINCIPLE.  Wilson  and  Nickel  [1],  following  Gurtin's  [2] 
discussion  of  variation  principles  for  linear  initial  value  problems, 
confirmed  that  the  function  T  (x,y,z,t)  which  leads  to  an  extreme  ot 
functional 


-  1/2  \  ^  CpT*T  ♦  v  V  T  -  2  ?  Cpf0*T}  dU 

v 

-  ^  "Oi n I  *T  dS 


is  the  solution  of  the  fo I  toeing  transient  heat -conduct  I  on  equation: 


(K*1\  i)  ,  i  -  ?CP  *  +  ?*P  *0 


■  ith  the  boundary  condition  K»T,I  -  0 *  ■  0 


Where  T(*/y#*,t)  •  temperature  at  the  spatial  point  (*»y*z)  and  at  tl»e  t 


To  *  Initial  temperatures 


T  *  Gradient  of  T  eith  respect  to  spatial  coordinates 


K  •  Thermal  conductivity 


f  *  Hater i a  I  density 


Cp  3  Heat  capability  of  the  material  per  unit  moss 


ij  <■)  -  J  Qi  (x«y»2''t) 


Uolume 


*  9 


Convolution  symbol  defined  as: 


T«T  ■  £  T(*.y*z,t-*t5  T  (*.y,*/t:)  dX 


s7T  *■  "9T  » 


_  oT  *■  *  kL  -v 


ax  ax 
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FINITE  ELEMENT  SYNTHESIS.  Divide  the  three  dimensional  solid  body 
into  I  axial  elements  (nodes  o  to  I  in  x-direction) ,  J  transverse 
elements  (nodes  o  to  J  in  y-direction) ,  and  K  normal  elements  (nodes 
o  to  K  in  z-direction)  such  that  step  sizes  are  same  in  all  three 
directions.  This  restriction  is  introduced  to  simplify  algebraic 
manipulations  involved  in  the  analysis.  Also,  a  unit  step  size  is 
assumed  to  simplify  derivations  and  generalized  later. 

Consider  the  nodal  point  (i,j,k),  in  the  range  0,  <  i  <  I,  o  <  j 

<  J,  0  <  k  <  K)  as  shown  in  Figure  1.  The  temperature  of  the  nodal 

point  will  vary  as  a  function  of  time,  t.  The  temperature 

distribution  in  a  subregion  is  a  function  of  spatial  coordinates 

(x,y,z)  and  surrounding  nodal  point  temperatures.  For  simplicity, 

linearity  and  the  same  functional  distribution  are  assumed  for  all 

elements.  The  functions  f,  f_  f_  f .  fc  f.  f_  and  fQ  are  functions 

1,  2,  3,  4,  5,  6,  7,  8 

of  nodal  point  temperatures.  These  are  determined  by  substitution  of 
the  coordinates  of  ncdal  points  into  the  equation  and  by  solving  the 
resulting  simultaneous  equations.  The  results  for  region  II  are 
given  on  the  next  page. 
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Region  II: 


T  (x,y,z)  =  f1  +  f2  x  +  f3y  +  f^z  +  f5xy  +  fg  xz  +  f?yz  +  fgxyz 
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Where  the  subscripts  +  and  -  denote  nodes  one  step  ahead  and  one-step 
behind  respectively.  Once  the  temperature  distribution  is  obtained, 
anyone  can  derive  the  partial  first  order  derivatives  from  the  above 
equation.  Similar  set  of  equations  can  be  derived  for  the  remaining 
seven  regions  of  the  three  dimensional  finite  element,  discussed 
above.  It  is  time  now  to  substitute  all  equations  derived  into  all 
three  terms  of  functional,  governing  equation,  and  integrate  over  the 
volume  occupied  by  region  II  and  take  the  first  variation  with 
respect  to  at  the  new  time  in  order  to  obtain  the  extremum  of 

the  functional.  Obtain  similar  results  for  the  other  regions  and 
combine  them. 

The  procedures  for  the  first  and  the  third  term  of  the  governing 
equation  are  same.  However,  double  convolution  symbol  is  involved  in 
the  second  term.  Here  the  evaluation  of  the  second  term  in  the 
governing  equation  involves  integration  not  only  over  the  volume  (and 
the  use  of  first  variation  with  respect  to  the  nodal  temperature, 

at  the  new  time)  but  also  over  the  time-step  due  to  the 

additional  convolution  symbol.  Towards  this  goal,  a  linear  nodal 
point  temperature  variation  is  assumed  within  each  time-step. 

Summing  up  the  results  of  all  three  terms  produces  the  following 
finite  element  equivalent; 
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The  superscripts  +  and  o  denote  new  time  and  old  time,  respectively, 
for  any  time-step,  e  is  the  dimensionless  parameter,  Fourier  number 
(  iv  At  /  ^ \3r )  .  The  left  side  of  this  equation  represents  the  weighted 
average  of  time-derivatives  at  all  nodes  of  a  27-node  finite  element 
system.  Of  course,  the  weights  are  different  depending  upon  the 
location  of  the  node.  The  right  side  of  this  equation  is  equivalent 
to  an  arithmetic  average  of  3 -dimensional  Laplacian  (in  a  particular 
finite  difference  form)  at  two  different  times,  old  and  new.  This 
particular  finite  difference  form  for  representing  a  laplacian  is  not 
the  best  one  as  far  as  accuracy  and  nonoscillation  characteristics 
are  concerned  [3].  The  details  of  derivation  that  lead  to  finite 
element  equivalent  are  omitted  here  due  to  lack  of  space  and  time. 
However,  it  is  planned  to  publish  in  a  journal,  similar  to  the  two- 
dimensional  case  [4). 

One  can  also  rewrite  the  above  equation,  finite  element 
equivalent,  in  a  more  familiar  form  to  the  finite  difference 
community  as  shown  below.  All  unknown  nodal  point  temperatures  are 
arranged  on  the  left  side  and  all  known  quantities  are  shuffled  to 
the  right  side  of  this  equation  jus*-  ike  in  a  traditional  way.  The 
coefficients,  A  through  H,  are  different.  These  are  not  given  here 
due  to  lack  of  space.  However,  one  can  write  them  by  comparison  of 
this  equation  with  the  above  finite  element  equivalent. 
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OTHER  FINITE  ELEMENT  SCHEMES.  Classical  finite  element  scheme  is 
discussed  above.  Linear  temperature  distribution,  in  space,  is 
allowed  in  addition  to  linear  temperature  variation  within  each  time- 
step.  Similarly,  one  can  also  treat  quadratic,  cubic,  etc., 
variation  in  space.  However,  analytical  evaluation,  similar  to  the 
above,  is  almost  impossible  because  of  tedious  and  extensive 
algebraic  work  and  associated  errors  involved.  Sometimes,  this  is 
referred  to  in  the  literature  as  h-version  finite  element  method. 

There  are  three  well  established  general  approaches  in  the  finite 
element  method: h-version,  p-version  and  h/p-version.  In  the  h- 
version  of  finite  element  method,  the  element  approximation  is  kept 
fixed  and  the  mesh  is  refined  thereby  adding  more  degrees  of  freedom 
to  improve  the  accuracy  of  the  solution.  In  the  p-version  of  finite 
element  method,  the  order  of  approximation  for  each  element  of  a 
fixed  grid  is  increased  thereby  adding  more  degrees  of  freedom  to  an 
existing  model  to  improve  the  accuracy  of  the  solution.  In  the  h/p- 
version  of  the  finite  element,  the  mesh  refinement  and  the  order  of 
approximation  for  the  element  is  accomplished  simultaneously.  The  p- 
version  of  finite  element  methods  yields  a  much  higher  convergence 
rate  and  requires  very  simple  models  which  need  to  be  constructed 
only  once.  For  singular  problems  such  as  near  freezing  or  melting 
interfaces,  the  rate  of  convergence  of  the  p-version  is  even  higher 
than  for  smooth  problems.  Computational  benefits  of  hierarchical 
properties  result  in  additional  saving  and  efficiency.  The  inability 
of  h-version  to  correctly  simulate  singular  problem  behavior  is  well 
known.  In  such  applications  p-version  (or  h/p-version)  is  the  only 
feasible  approach.  As  a  result  this  newer  technique  has  found 
considerable  application  in  modeling  composite  structures,  large 
strain  and  deformation. 

CONCLUSIONS.  Hypervelocity  projectile  is  in  the  DODs  critical 
technology  list.  The  task  reported  here  directly  supports  this 
project.  Classical  finite  element  analysis  is  performed  on  transient 
three  dimensional  heat  conduction  problems  by  the  use  of  variational 
principle  and  finite  element  synthesis.  The  finite  element 
equivalent,  derived  here,  appears  first  time  in  the  literature.  The 
terms  on  the  left  and  right  sides  are  identified.  It  is  found  that 
the  classical  finite  element  analysis  may  not  be  good  based  on 
physical  interpretation  of  terms  and  comparison  with  other  finite 
difference  schemes  especially  if  accuracy,  stability  and 
nonoscillation  characteristics  are  taken  into  account.  Other  finite 
element  variations  which  are  superior  at  least  for  special  problems 
such  as  melting  or  freezing  are  also  discussed  if  one  is  interested 
in  pursuing  only  finite  element  methodology. 
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Abstract 

Motion  control  of  mechanical  systems  with  inherent  structural  and  joint  flexibility 
represents  a  class  of  nonlinear  systems  found  in  many  important  applications.  Applications  include 
the  motion  control  of  machines  used  widely  in  weapons,  medicine,  construction,  mining,  and 
manufacturing.  Control  of  the  tip  motion  (e.g.,  the  end  effector  in  the  case  of  a  robot  manipulator 
or  the  active  end  of  a  pointing  system)  in  an  inertial  coordinate  frame  is  the  objective  in  most  such 
applications.  This  is  the  case  because  tasks  are  usually  specified  by  the  desired  motion  trajectory 
of  the  tip  described  inertially.  As  such,  one  needs  to  compute  and  control  the  applied  actuator 
torques  so  that  the  motion  of  the  tip  will  match  the  prescribed  motion.  However,  a  non-minimum 
phase  system  arises  when  one  attempts  to  control  the  motion  of  a  mechanically  compliant  system 
by  using  a  measurement  of  the  tip  to  control  a  non-collocated  actuator. 

The  viability  of  a  delayed  adaptive  inverse  method  for  motion  control  of  the  tip  of  a  flexible 
beam  is  demonstrated  in  this  paper.  This  method  is  based  on  an  adaptive  linear  FIR  filter  which 
provides  a  stable  and  close  approximation  to  the  inverse  plant  dynamics.  Such  FIR  filters  can  be 
used  to  control  non-minimum  phase  systems,  certain  nonlinear  systems,  or  plants  of  unknown 
dynamics  and  can  be  implemented  using  real-time  interrupt  driven  high  performance  computational 
devices  such  as  digital  signal  processing  (DSP)  based  hardware.  A  least  mean  square  (LMS)  error 
minimization  is  used  to  update  the  weight  vector  which  forms  a  delayed,  adaptive  inverse  dynamic 
model  of  the  system.  By  using  an  FIR  filter  to  represent  the  inverse  of  plant  dynamics,  the 
instability  associated  with  pole  zero  cancellation  on  the  right  half  Laplace  plane  is  avoided  when 
controlling  non-minimum  phase  systems.  Properly  selected  values  of  the  initial  weights  and  of  the 

gain  constant,  p,  guarantee  a  bounded  solution  for  the  computed  input  torque.  The  effects  of 
varying  parameters  of  the  delayed  adaptive  inverse  controller  (such  as  the  values  of  the  initial 
weights,  the  gain  constants,  and  the  length  of  delay)  on  the  system  performance  are  presented. 
Since  the  method  is  computationally  efficient,  it  can  be  used  in  high  bandwidth  applications  for  the 
control  of  complex  plants  given  only  the  desired  and  the  actual  measured  outputs  for  the  plant.  A 
simple  feedback  controller  is  used  in  conjunction  with  the  delayed  adaptive  inverse  controller  to 
eliminate  the  residual  errors  that  would  occur  if  using  only  feed  forward  control. 

The  delayed  adaptive  inverse  controller  was  designed  based  on  a  model  of  the  flexible  beam 
test  bed  located  at  the  Armament  Research,  Development  and  Engineering  Center  (ARDEC).  The 
performance  of  this  adaptive  controller  is  compared  with  that  of  an  H^,  based  loopshaping 
controller.  The  results  indicate  that  higher  performance  is  achieved  when  using  the  adaptive 
inverse  controller  if  sensor  noise  is  minimal.  If  not,  the  robust  H*,  controller  would  be 
recommended. 


1  Work  wzs  prtidly  supported  by  the  Army  High  Performance  Computing  Research  Center  and  the  Productivity 
Center  at  the  University  of  Minnesota. 


643 


1.  Introduction 

There  are  many  tasks  in  which  the  precise  motion  control  of  a  point  on  a  moving  structure  is  of 
interest  rather  than  the  actuator's  motion  itself.  Robot  manipulators  are  but  one  example  of  such  a 
system.  Each  robot’s  actuated  link  can  be  considered  to  be  a  motor  driven  beam  in  which  the 
beam’s  tip  motion  must  be  controlled.  For  manipulators,  tasks  are  specified  by  the  desired  motion 
trajectory  of  the  tip.  This  means  that  one  needs  to  compute  and  control  the  applied  torque  at  each 
joint  so  that  the  measured  motion  of  the  tip  will  match  the  prescribed  motion.  Two  problems  arise 
in  this  case.  One,  sensors  for  measuring  tip  motion  have  to  date,  been  severely  limited  in 
bandwidth;  and  two,  given  the  non-minimum  phase  nature  of  such  non-collocated  systems,  the  use 
of  traditional  inverse  dynamics  based  control  will  lead  to  instabilities2 .  The  problem  in  our  case  is 
further  exacerbated  by  the  nonlinear  nature  of  the  plant  to  be  controlled.  Nonlinearities  here 
include  the  inherent  geometric/continuous  nonlinearities  and  the  discontinuity  effects  such  as 
backlash,  stiction  and  friction.  An  exact  Fourier  series-based  inverse  method  has  been  developed 
(Yang,  1991),  in  which  good  agreement  was  achieved  between  the  desired  and  computed  motion. 
However,  the  use  of  such  exact  inverse  dynamics  based  formulations  for  the  control  of  non¬ 
minimum  phase  systems  requires  prior  knowledge  regarding  the  desired  motion/path  boundary 
conditions  and  can  lead  to  instabilities  due  to  the  generation  of  unstable  poles  in  the  controller. 

In  order  to  address  the  first  problem  above,  we  have  developed  a  sensor  that  can  be  used  for 
tracking  the  six  degrees  of  freedom  motion  of  an  end  effector.  In  Sorensen  et  al.  (1989),  we 
described  our  first  prototype  which  is  capable  of  high  bandwidth  (480  Hz)  measurement  of  the 
XYZ  coordinates  of  moving  points  on  rigid  or  deformable  bodies.  The  system,  based  on  a  laser 
scanning  approach,  is  designed  in  a  pipeline  configuration  such  that  each  sensed  point's 
coordinates  are  immediately  available  in  registers  and  memory  mapped  into  the  data  acquisition 
CPU.  Three  planes  of  light  rotate  through  the  measurement  field  at  constant  angular  velocity.  By 
measuring  the  elapsed  time  for  rotation  from  fixed  locations  at  the  boundaries  of  the  field  to  each  of 
the  moving  point  targets,  one  can  derive  the  swept  angle*  to  each  of  the  targets  and  consequently 
their  XYZ  coordinates.  In  addition  to  being  able  to  track  each  point  target’s  motion,  each  target  has 
its  own  path  into  the  CPU  thus  facilitating  the  accurate  computation  of  (a)  the  six  degrees  of 
freedom  of  any  number  of  bodies  each  carrying  at  least  three  targets  and  (b)  the  relative  motion  of 
the  instantaneous  axis  of  rotation  between  bodies.  Significant  improvements  have  been 
implemented  as  a  follow-up  to  the  experiments  described  in  Sorensen  et  al.  (1989).  As  a  result, 
such  sensors  can  now  be  used  to  characterize  the  relative  motion  of  joints  (e.g.  transmissions)  and 
structural  elements  even  when  compliance  is  a  major  factor.  It  is  now  therefore  possible  to 
continuously  and  simultaneously  monitor  the  motion  of  one  or  more  mobile  base  platforms,  the 
exact  behavior  of  each  of  the  robot  joints,  their  motors  and  transmission  effects,  the  structural 
deformations  of  the  links,  and  the  full  six  degrees  of  freedom  of  the  end  effector  (or  load)  using 
one  sensing  system.  This  advance  in  technology  makes  it  possible  to  consider  multi-degree  of 
freedom  end  point  control. 

In  this  paper,  we  will  focus  on  a  method  to  handle  the  second  problem  mentioned  earlier,  that 
of  controlling  a  nonlinear  non-minimum  phase  system.  We  will  present  a  method  for  controlling 
the  tip  motion  of  a  flexible  beam.  In  contrast  to  most  inverse  dynamics  methods  proposed  by  other 
researchers,  the  delayed  adaptive  inverse  method  that  we  propose  attempts  to  address  the  problems 
associated  with  the  non-minimum  phase  behavior  of  the  flexible  system  and  provides  a  stable  and 
close  approximation  to  the  inverse  dynamics.  Furthermore,  the  approach  is  conducive  to  the  use  of 
high  speed  Digital  Signal  Processing  (DSP)  hardware.  Most  of  the  early  research  work  on  flexible 
robot  manipulators  used  collocated  feedback  methods.  As  such,  no  direct  measurement  was  made 
of  the  effect  of  compliance  or  other  unexpected  dynamics  located  between  the  actuator  and  the  point 


2  A  non-minimum  phase  system  typically  arises  when  one  attempts  to  control  the  motion  of  a  flexible  structure 
using  a  measurement  of  the  tip  to  control  a  non-collocated  actuator. 
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of  interest  (e.g.  manipulator  tip)  under  control.  Several  papers  falling  into  this  category  are 
discussed  below. 

Book  et  al.  (1975)  implemented  a  flexible  feedback  control  scheme  (FFC)  based  on  a  pole 
shifting  algorithm.  Siciliano  and  Book  (1988)  applied  the  singular  perturbation  and  integral 
manifold  approaches  to  control  a  single  link  flexible  arm.  To  overcome  the  sensitivity  of  these 
methods  to  plant  parameter  variations,  adaptive  methods  were  investigated.  Nelson  and  Mitra 
(1986)  used  on-line  load  mass  estimation  and  load-adaptive  optimal  control  for  varying  load 
conditions.  Other  adaptive  control  methods  were  also  proposed  by  Meldrum  and  Balas  (1985). 
However  all  these  adaptive  control  designs  involve  a  significant  computational  component  thereby 
limiting  their  application  to  real  time  control.  Again  these  papers  are  limited  to  cases  of  collocated 
feedback. 

To  obtain  better  performance  for  very  flexible  manipulators.  Cannon  and  his  research  group 
(Cannon  and  Schmitz,  1984;  Schmitz,  1989)  experimentally  investigated  end-point  control 
techniques  for  flexible  link  systems  in  planar  motion.  This  was  the  first  attempt  at  non -collocated 
control  for  a  flexible  beam  model  of  a  manipulator  using  traditional  control  design  methodologies. 
Lee  and  Castelazo  (1987)  developed  a  non -collocated  sensor-actuator  scheme  with  non-linear 
feedback  to  control  a  flexible  manipulator.  However,  there  were  a  number  of  limitations  associated 
with  this  latter  group  of  non -col  located  techniques,  such  as  the  inability  to  handle  joint  compliance 
and  the  uncertainty  associated  with  a  model  of  the  system. 

Research  on  the  control  of  systems  with  joint  flexibility  was  first  initiated  in  the  1980s.  Under 
the  assumption  of  weak  elasticity  at  the  joint,  Ficola  et  al.  (1983)  used  the  singular  perturbation 
approach  to  design  a  feedback  controller.  Khorasani  and  Spong  (1985)  extended  this  work  by 
using  the  invariant  and  integral  manifold  approach.  A  pseudo-linearization  technique  was  proposed 
by  Nicosia  et  al.  (1986)  while  adaptive  control  approaches  were  investigated  by  Tomei  et  al. 
(1986).  All  these  studies  ignored  structural  compliance  and  cannot  readily  be  modified  to 
incorporate  such  compliance. 

Furthermore,  the  control  methods  in  most  of  the  literature  mentioned  above  was  based  on  an 
analysis  of  the  forward  dynamics,  in  which  the  flexible  link  (or  joint)  displacements,  velocities  and 
accelerations  are  determined  by  the  specified  joint  torque.  An  inverse  dynamics  type  of  control  law 
based  on  the  concept  of  "Computed  Torque"  was  first  applied  to  a  cylindrical  coordinate  arm  with 
drive  train  compliance  by  Forrest-Barlach  and  Babcock  (1986).  Other  such  feedforward  methods 
followed.  Asada  and  Ma  (1989)  presented  a  recursive  inverse  dynamics  analysis  based  on  a  virtual 
rigid  link  coordinate  system.  Tsujio  (1988)  suggested  another  approach,  in  which  the  driving  force 
was  calculated  based  on  the  rigid  manipulator,  and  then  applied  to  a  flexible  manipulator  iteratively 
until  it  converged.  None  of  this  tatter  group  of  inverse  based  methods  was  adaptive  and  as  such 
are  sensitive  to  parameter  uncertainty.  Furthermore,  none  took  advantage  of  tip  feedback  and 
suffered  from  the  limitations  inherent  to  systems  in  which  direct  measurement  of  the  parameter 
under  control  is  not  available. 

An  approach  that  explicitly  deals  with  the  non-minimum  component  of  a  plant  was 
described  by  Tomizuka  (1987)  who  proposed  a  zero  phase  error  tracking  controller  (ZPETC)  to 
solve  a  tracking  control  problem.  This  approach  used  a  feedforward  controller  to  cancel  all  closed- 
loop  poles  and  cancelable  closed-loop  zeros  (zeros  on  the  left-half  of  the  s-plane).  For  uncancelable 
zeros  (zeros  on  the  right-half  of  the  s-plane),  the  feedforward  controller  cancels  the  phase  shift 
induced  by  them.  Since  ZPETC  is  based  on  pole/zero  cancellation  and  phase  cancellation,  the 
tracking  performance  using  ZPETC  is  sensitive  to  modeling  errors  and  plant  parameter  variations. 
To  solve  the  problem  with  unknown  plant  parameters  and  parameter  variations,  Tsao  and 
Tomizuka  (1987)  developed  an  adaptive  ZPETC.  The  system  plant  is  separated  into  a  known  pan 
(with  cancelable  poles  and  zeros)  and  an  unknown  pan.  A  normalized  least  square  parameter 
adaptation  algorithm  (PAA)  was  used  to  adjust  the  unknown  parameters  used  in  the  feedforward 
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controller.  However,  in  both  cases,  the  results  were  dependent  on  having  a  good  model  of  the 
system. 

Another  approach  for  designing  controllers  of  non-minimum  phase,  linear  systems  is  H*,  based 
loopshaping  (Doyle  et  al.,  1992).  This  approach  presents  a  graphical  technique  for  designing  a 
controller  to  achieve  robust  performance,  i.e.,  performance  in  the  presence  of  uncertainty.  The 
idea  of  loopshaping  is  to  shape  the  complementary  sensitivity  transfer  function,  the  transfer 
function  from  the  reference  signal  to  the  output  of  the  plant,  in  order  to  obtain  the  desired  system 
performance.  The  ideal  loopshape  of  the  complementary  sensitivity  transfer  function  is  represented 
by  a  second  order  system  with  desired  response  characteristics  such  as  the  amount  of  damping  and 
natural  frequency.  This  transfer  function  is  then  incorporated,  as  a  weighting  function,  into  the 
system  model  for  a  controller  design  that  minimizes  the  induced  two  norm  between  the  reference 
signal  and  the  output  of  the  weighting  function. 

In  this  paper  we  examine  a  method  based  on  a  delayed  adaptive  inverse  filter  which  eliminates 
the  non-minimum  problem  associated  with  inverse  methods  by  using  an  FIR  filter  for  feedforward 
control.  The  method  is  based  on  the  on-line  characterization  of  the  inverse  dynamics  based  on  an 
adaptation  algorithm  in  which  a  delay  must  be  injected  to  ensure  causality.  We  will  demonstrate  the 
viability  of  the  delayed  adaptive  inverse  method  for  motion  control  of  flexible  beams.  An  adaptive 
linear  function  whose  weighting  is  adapted  by  the  LMS  algorithm  was  used  to  replace  the  unstable 
exact  inverse  model.  By  using  a  FIR  filter  to  represent  the  inverse  dynamics,  we  avoid  the 
instabilities  associated  with  pole  zero  cancellation  (only  zeros  are  possible  in  FIR  filters)  when 
controlling  non-minimum  phase  systems,  and  we  can  now  take  advantage  of  high  speed 
computational  architecture  based  on  the  DSP  chip.  Furthermore,  we  may  be  able  to  control 
complex  plants  given  only  the  desired  and  actual  output  values  for  the  plant,  provided  that  the 
control  inputs  are  accessible.  Since  the  method  is  computationally  efficient,  it  car.  be  incorporated 
into  closed-loop  controllers  for  position  or  trajectory  control  so  that  the  residual  errors  can  be 
eliminated  (see  discussion  later).  It  can  also  be  applied  to  multi-degiee-of-freedom  systems. 

This  paper  significantly  expands  on  an  earlier  preliminary  presentation  describing  the  method 
(Yang  and  Donath,  1990).  A  general  description  of  the  delayed  adaptive  inverse  model  is  provided 
in  section  2.  The  weight  vector  used  in  this  model  is  updated  using  the  LMS  algorithm,  which  is 
discussed  in  section  3.  The  contents  of  sections  2  and  3  were  presented  in  Yang  and  Donath 
(1990),  but  they  are  included  here  for  completeness.  A  control  strategy  based  on  the  adaptive 
inverse  model  is  described  in  section  4.  The  approach,  as  applied  to  a  model  of  the  flexible  beam 
located  at  the  Armament  Research,  Development  and  Engineering  Center,  is  described  in  section  5. 
A  comparison  of  this  adaptive  inverse  controller  with  an  H«,  based  loopshaping  controller  is  then 
discussed  in  section  5. 


2.  A  Delayed  Adaptive  Inverse  Model 

Adaptive  filters  are  widely  used  in  digital  signal  processing,  such  as  in  communications,  radar, 
sonar,  and  seismology,  where  a  priori  data  information  is  unknown  (Haykin,  1986).  In  a 
stationary  process,  the  algorithm  starts  with  a  set  of  initial  conditions  and  converges  to  its  optimum 
solution  after  successive  iterations.  In  the  nonstationary  case,  the  algorithm  cracks  time  variations 
in  the  statistics  of  the  input  data  for  a  slow  changing  process.  The  application  of  adaptive  filters  to 
control  design  can  be  categorized  into  two  groups:  (i)  the  adaptive  model  control  (AMC)  method; 
and  (ii)  the  adaptive  inverse  control  (AIC)  method.  It  is  difficult  to  control  a  non-minimum  phase 
system  by  using  the  AMC  method,  since  the  control  signal  will  have  a  transform  with  poles  outside 
the  unit  circle  and  will  thus  be  unstable.  However,  a  delayed  AIC  method  can  form  stable 
approximate  inverse  models  without  knowing  a  priori  whether  or  not  the  plant  is  minimum  phase. 

In  this  paper,  we  will  apply  the  delayed  AIC  method  to  the  tip  control  of  flexible  beams,  a  non¬ 
minimum  phase  system.  An  adaptive  linear  combiner,  or  nonrecursive  adaptive  filter,  is  used  to 


646 


compute  the  applied  joint  torque,  which,  in  turn,  generates  a  desired  motion.  One  advantage, 
among  many,  of  the  adaptive  linear  combiner  is  that  it  is  relatively  straightforward  to  analyze  and 
implement.  A  schematic  diagram  of  a  delayed  adaptive  inverse  model  used  in  the  AIC  method  is 

shown  in  Figure  1,  where  the  terms,  uk,  dk,  nk,  Yk,  Vk,  and  £k  are  respectively  the  command 

input,  the  command  after  a  delay  of  A  sample  periods,  the  noise  input,  the  plant  output,  the  output 
of  the  inverse  plant  model,  and  die  error  between  dk  and  Vk.  P(z)  and  H(z)  are  z  transfer  functions 
of  the  plant  and  of  the  adaptive  inverse  model. 

This  inverse  modeling  approach,  known  as  adaptive  equalization  in  the  communications  field, 
was  first  used  in  the  1960s  to  counter  the  effects  of  interference  on  communication  channels 
(Lucky,  1965).  It  can  also  be  used  to  produce  an  inverse  model  of  an  unknown  plant,  which  is  in 

some  sense  a  best  fit  to  the  reciprocal  of  the  unknown  plant  transfer  function.  The  delay  of  A 
samples  in  Figure  1  is  to  allow  for  the  delay,  or  propagation  time,  through  the  plant  and  the 
adaptation  associated  with  the  inverse  modeling.  Including  such  a  delay  generally  results  in  a  much 
lower  value  for  the  minimum  mean-square-error  and  causes  the  output  of  the  converged  adaptive 

system,  yk,  to  approximate  the  input,  uk,  with  a  delay  of  A. 

The  mathematical  description  for  this  model  can  be  expressed  in  discrete  system  form  as 
follows.  The  input  vector  of  the  inverse  model,  Y,  which  represents  a  window  of  L  values 
associated  with  the  kth  sample,  is  the  input  to  the  delayed  inverse  model  (assuming  nk  is  zero).  It 
can  be  expressed  as: 

(  single  input )  Yk  =  [yk  yk_,  ...  yk_Ui]T; 

(  multiple  inputs  )  Yk  =  [ylk  y2k  ...  yLk]T  (1) 


for  the  single  input  case  or  for  multiple  inputs,  respectively,  and  the  weight  vector,  Wk,  associated 
with  the  kth  sample  can  be  expressed  as  Wk  =  (w^.w^  ...  wj^  ]T.  The  expression  for  w^  (i  =  1, 
...  L)  will  depend  on  the  approach  used.  The  elements  of  the  input  vector  for  the  inverse  model, 
Yk,  are  considered  to  be  L  sequential  sample  inputs  from  a  single  input  source,  or  L  simultaneous 
inputs  from  L  different  input  sources  in  the  multiple  inputs  case.  The  elements,  yj  (where  j  =  k,  k- 
1, ...  k-L+1),  of  the  single  input  vector  Y,  are  set  to  zero  if  subscript  j  is  equal  to  or  less  than  zero 
(i.e.,  when  k  <  L). 

The  output  of  the  inverse  model,  V,  for  a  linear  combination  of  the  input  vector,  Yk,  and 
weight  vector,  Wk,  is  written  as 


Vk  =  Yj[Wk  =  WjYk 


(2) 


where,  again,  the  subscript  k  is  used  as  a  time  or  sample  number  index.  This  function  is  the  linear 
combiner  or  weighting  function. 

Thus  the  error  associated  with  sample  k  from  Figure  1  is 


elc =  dk  -  Vk  =  d„  -  Yk  Wk  =  dk  -  Wk  Yk  (3) 
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and  the  mean-square  error  is 


4  =  E[e*]  =  Eld l)  +  W l  E[Yk  Yj]  Wk  -  2  Elc^  Y J]  Wk 
=  E[dJ]  +  wjRWk-2PWk  (4) 


where  the  input  correlation  matrix  is  R  =  E[Yk  Yj[],  and  the  cross  correlation  column  vector  is  P  * 
E[4  Yjl. 

Our  goal  is  to  find  the  minimum  mean-square  error  for  the  inverse  model.  Several  adaptive 
methods,  such  as  Newton's  method  and  the  steepest  descent  method,  can  be  applied  to  adjust  the 
weight  vector  in  order  to  obtain  the  minimum  mean-square-error.  The  disadvantage  of  these  two 
methods  is  that  they  usually  require  off-line  gradient  estimation  or  repetition  of  data  in  order  to 

compute  ^  in  equation  (4)  and  its  gradient,  which  reduces  the  computation  speed  for  real  time 
applications.  By  contrast,  the  LMS  algorithm  does  not  require  an  estimate  of  all  the  above  terms, 
and  computes  the  weighting  function  based  only  on  instantaneous  error.  The  details  of  the  LMS 
algorithm  are  described  in  the  next  section. 


3.  The  LMS  Algorithm 

In  Newton’s  method  or  in  the  steepest  descent  method,  the  gradient  of  £  *  E[e^]  is  estimated 
by  taking  differences  between  short-term  averages  of  £^.  However,  for  the  LMS  algorithm, 
itself  is  used  as  an  estimate  of  £k  without  averaging. 

The  error  in  equation  (3)  is 


Ek  =  dk-Yjwk 


(5) 


and  the  estimated  gradient  of  the  mean-square-enor  can  be  obtained  by  differentiating  equation  (5) 


'  d£k  " 

'  3ek' 

3wj 

3wj 

• 

=  2ek 

• 

dgk 

.  5wl, 

.  dwL 

=  -2ek  Yk 


(6) 
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For  many  practical  adaptive  system  applications,  the  system  transfer  function  is  not  fully 
determined  and  needs  to  be  measured  or  estimated  based  on  the  sensed  data.  The  slow  adaptation 
(compared  with  Newton's  method)  used  in  the  steepest  descent  method  provides  a  filter  process 
which  reduces  the  effects  of  gradient  measurement  noise.  Using  the  same  slow  adaptation  as  in  the 
steepest  descent  method,  the  LMS  algorithm's  weighting  update  function  is  expressed  as 


Wjjc+i  -  Wjjj  *[i  V|j  =  w^  +  2  p.  Ejj  Yfc  (i  -  1,  •••  L)  (7) 


where  p  is  the  gain  constant  that  regulates  the  speed  and  stability  of  the  adaptation  and  has 
dimensions  of  reciprocal  input  power.  Equations  (5)  and  (7)  are  a  form  of  the  adaptive  LMU 
algorithm  proposed  by  Widrow  et  al.  (1975).  It  has  been  pointed  out  by  Widrow  and  Steams 

(1985)  that  for  the  LMS  algorithm,  the  mean  of  £  converges  to  zero  as  k  approaches  infinity 
provided  that  the  following  condition  is  satisfied:  0  <  p  <  1/Xmax,  where  Xmax  is  the  largest 

element  in  the  diagonal  eigenvalue  matrix  of  R.  Increasing  the  gain  constant,  p,  tends  to  reduce  the 
number  of  iterations  required  for  the  LMS  algorithm  to  reach  its  steady-state  value,  but  also  causes 
a  corresponding  increase  in  the  average  square  error. 

Only  simple  numerical  operations  (no  squaring,  averaging  or  differentiation)  are  performed  in 
the  LMS  algorithm  making  this  approach  computationally  more  efficient  than  Newton's  method  or 
the  steepest  descent  method.  Since  the  change  in  the  weight  vector  at  each  iteration  is  based  on 

imperfect  estimates,  e^,  without  averaging,  the  results  obtained  from  the  LMS  algorithm  may 
include  higher  frequency  components  (i.e.,  noise)  and  will  not  be  a  perfect  match  with  the  optimal 
(i.e.,  the  case  when  e  =  0).  Usually,  the  more  one  wants  to  attenuate  the  noise,  the  longer  the 
computation  time,  but  given  the  nature  of  the  LMS  algorithm,  this  is  not  a  significant  factor.  The 
expected  residual  error  can  be  reduced  by  using  closed  loop  control. 


4.  Control  Based  On  An  Adaptive  Inverse  Model 

Based  on  the  inverse  model  described  in  section  2  with  weights  adapted  by  the  LMS  algorithm, 
we  developed  a  delayed  adaptive  inverse  model  controller.  The  controller  is  designed  to  generate 
the  approximate  inverse  of  the  plant  given  the  desired  and  actual  outputs.  For  a  non-minimum 
phase  system,  some  of  the  plant  transfer  function  zeros  are  located  in  the  right  half  of  the  s-plane  or 
outside  the  unit  circle  in  the  z-plane  for  the  discrete  (or  digital)  case.  In  such  cases,  the  exact 
inverse  model  (or  the  reciprocal  transfer  function)  of  the  plant  will  have  poles  in  the  right  half  of 
the  s-plane  or  poles  outside  the  unit  circle  in  the  z-plane.  Thus,  the  control  input  of  a  plant 
computed  by  using  an  exact  inverse  model  will  always  continue  to  increase  in  magnitude  with  time 
and  be  unbounded.  This  will  lead  to  an  unstable  system  which  will  most  likely  drive  the  system 
into  saturation.  In  order  for  such  an  inverse  model  to  be  stable,  the  impulse  response  with 
adaptation  weights  would  need  to  be  left-handed  in  time,  having  a  non-zero  input  and  output  before 
the  stan  time  (Haykin,  1986).  Including  a  delay  lets  the  adaptive  inverse  model  have  a  two-sided 
impulse  response,  thus  solving  the  instability  problem.  A  delayed  adaptive  inverse  model  (Widrow 
and  Steams,  1985)  which  can  be  used  for  control  of  a  non-  or  unknown  minimum  phase  plant,  as 
shown  in  Figure  2,  will  give  an  approximated  but  stable  system.  This  means  that  the  stability  of  the 
controller  can  be  assured  regardless  of  whether  or  not  the  plant  is  minimum  phase. 

The  delayed  adaptive  inverse  mode!  in  the  right  half  of  Figure  2  is  essentially  the  same  as  that 
shown  earlier  in  Figure  1.  The  controller  version  on  the  left  is  a  copy  of  the  delayed  adaptive 
inverse  model.  The  weights  are  computed  and  updated  on-line  for  real-time  implementation  of  the 
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delayed  adaptive  inverse  control  algorithm.  Although  the  plant  transfer  function  in  Figure  2  may 
be  entirely  unknown,  certain  information  regarding  the  plant  characteristics  would  be  useful  for 
selecting  the  number  of  weights  and  the  length  of  delay.  The  number  of  weights  should  be 
proportional  to  the  complexity  of  the  plant;  the  higher  the  order  of  the  plant,  the  larger  the  number 
of  weights  should  be.  The  delay  should  approximate  the  propagation  time  from  the  input  of  the 
plant  to  the  output  of  the  delayed  adaptive  inverse  model.  A  general  rule  of  thumb  is  to  set  the 

delay,  A,  to  be  half  of  the  number  of  weights.  Once  a  convergent  set  of  weights  is  obtained,  the 
delay  is  increased  or  decreased  for  an  optimal  solution. 

The  approaches  for  updating  the  weight  vector  in  the  delayed  adaptive  inverse  model  can  be 
explained  using  the  schematic  shown  in  Figure  3,  where  L  is  the  number  of  weights  used  in  the 
delayed  adaptive  inverse  model,  tj  is  the  sample  period,  q  is  the  time  needed  per  iteration,  and  ^  is 
the  time  needed  for  convergence  of  the  weights  for  a  given  sample  set  of  inputs  and  outputs  of  the 
plant,  which  is  equal  to  the  number  of  iterations  to  convergence  multiplied  by  t,. 

One  approach  is  to  update  all  the  weights  only  once  at  each  sampling  time,  in  which  case  q  is 
equal  to  q.  The  number  of  sample  periods  used  in  this  approach  must  of  course  be  much  greater 
than  the  number  of  weights  in  order  to  achieve  convergence  to  the  desired  value.  After  a  change  in 
the  plant's  dynamic  characteristics,  a  considerable  number  of  sample  periods  (larger  than  the 
number  of  weights)  may  be  required  in  order  to  reach  convergence.  The  second  approach  is  to 
iterate  the  weights  using  the  LMS  adaptation  several  times  during  each  sampling  time,  in  which 
case  tc  is  equal  to  or  less  than  ts.  The  first  approach  is  applicable  to  very  high  sampling  rate 
systems.  The  second  approach  is  applicable  when  faster  processors  are  available  or  for  systems  in 
which  the  rate  of  convergence  is  faster  than  the  sampling  rate.  This  convergence  rate  will  be  a 
function  of  the  complexity  of  the  plant.  If  the  rate  of  convergence  is  slower  than  the  sampling  rate, 
one  can  use  the  maximum  number  of  iterations  that  are  possible  within  the  sampling  period.  Of 
course,  the  permissible  number  of  iterations  within  each  sampling  period  is  processor  dependent. 


5.  Control  of  a  Flexible  Beam  Unit:  Simulation  Results 

The  application  of  the  delayed  adaptive  inverse  method  to  a  flexible  beam  unit  with  joint  and 
structural  flexibility  was  reported  in  Yang  (1991).  It  was  demonstrated  that  the  delayed  adaptive 
inverse  method  works  for  controlling  systems  with  differentiable  nonlinearities.  In  this  paper, 
more  recent  results  for  the  delayed  adaptive  inverse  control  of  a  flexible  beam  system  will  be 
presented.  The  system  that  we  will  consider  here  is  a  testbed  located  at  the  Armament  Research, 
Development  and  Engineering  Center  (ARDEC)  to  simulate  the  motion  control  problems  found  on 
typical  Army  systems.  A  simplified  schematic  of  the  testbed  is  shown  in  Figure  4;  the  support  and 
bearing  structures  are  not  shown  for  clarity.  A  linear  model  of  the  testbed  developed  by  Bhat 
(1991)  is  provided  in  Figure  5.  Elements  of  the  block  diagram  model  shown  in  this  figure  are 
described  from  left  to  right.  The  first  block  is  the  torque  constant  of  the  actuating  motor  with 

Kt  =  0.89  N-m/volt .  (8) 

The  second  block  represents  the  motor  dynamics  described  by 


—  radians/N-m  ,  (9) 

s  *■ 

where  s  is  the  Laplace  variable.  The  third  block  represents  the  compliance  and  damping  of  the 
shaft;  it  has  the  form  Bs+K,  where  B  is  the  damping  term  and  K  is  the  spring  constant  of  the  shaft. 
They  are 
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B  =  0.1663  N-m/rad-sec'1,  and  K-  38  N-m/radians  . 


(10) 


The  output  of  this  block  is  the  effective  torque  applied  to  the  inertia  wheel.  The  next  block 
represents  a  state  space  model  of  the  inertia  wheel  and  the  associated  flexible  beam  dynamics.  Two 
outputs  of  this  block  are  the  inertia  wheel  angular  rotation  and  the  beam's  tip  acceleration.  The  last 
block  is  a  double  integrator  which  will  yield  the  tip  displacement  in  radians.  Bhat’s  state  space 
model  for  the  inertia  wheel  and  the  flexible  beam  was  developed  using  finite  element  methods  by 
modeling  the  beam  as  four  identical  beam  elements.  Figure  6  shows  the  inertia  wheel  and  the  beam 
element  partitions  modeling  for  finite  element  analysis.  The  beam  element’s  mass  and  stiffness 
matrices  arc  given  by 
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where  £,  p,  A ,  /,  /  are  the  Young's  modulus,  the  density,  the  cross  sectional  area,  the  moment  of 
inertia,  and  the  dement  length  of  the  beam,  respectively.  Each  segment  of  the  finite  element  of  the 
beam  satisfies  the  relationship  (the  terms  in  the  equation  are  specified  in  Figure  6b.) 
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The  root  locus  of  this  inertia  wheel  and  beam  model,  shown  in  Figure  7,  indicates  that  there  are 
three  right  half  plane  zeros  in  the  Laplace  domain;  this  information  confirms  that  the  system  is 
indeed  non-minimum  phase  due  to  the  non-collocatedness  of  actuation  and  sensing  locations.  A 
comparison  of  the  frequency  responses  of  the  linear  model  of  the  flexible  beam  derived  by  Bhat 
(1991 )  from  motor  torque  input  to  beam  tip  position  with  the  experimental  test  results  is  shown  in 
Figure  8.  For  the  analysis  proposed  in  this  paper,  reasonable  agreement  was  observed  over  a 
range  of  frequencies.  The  performance  objective  to  be  achieved  is  that  the  tip  of  the  beam  should 
accurately  track  a  desired  path  with  specified  boundary  conditions  on  the  motion.  In  this  study,  we 
will  focus  on  the  use  of  the  accelerometer  as  the  tip  motion  sensor  in  order  to  explore  the  limit  of 
this  readily  available  sensor.  The  desired  acceleration  profile  will  be  used  as  the  input  command, 
r(t),  (i.e.,  the  desired  motion)  of  the  controller  in  Figure  2.  Since  the  output  of  the  controller  is  a 
control  torque,  the  adaptive  inverse  model  is  generating  a  characterization  of  the  plant  that  can  be 

described  as  an  inertial  effect,  i.e.,  x  - 1  0 
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The  desired  displacement,  velocity,  and  acceleration  trajectories  are  generated  by  a  fifth  order 
Hermite  polynomial  (Forrest-Barlach  and  Babcock.  1986)  to  ensure  that  the  desired  motion  of  the 
tip  satisfies  the  initial  and  final  conditions  as  given  in  equation  (14). 


yd(0)  =  y<j(0)  =  =  0;  yd(tf)  =  ydt ;  yd(tf)  =  yd(tf)  =  0 


The  desired  tip  displacement,  velocity  and  acceleration  can  then  be  determined  from  the 
following  set  of  equations  which  meet  the  constraints  of  equation  (14) 
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For  the  set  of  tests  we  considered,  we  arbitrarily  picked  a  motion  involving  a  45  degree  swing 
in  one  second  followed  by  a  one  second  hold.  Therefore,  in  the  above  equation  (15),  tf  was  set  to 
1.0  second,  and  yd(tf)  was  set  at  0.785  radians.  The  sampling  period,  ts,  is  2.5  msec 
corresponding  to  a  sampling  frequency  of  400  Hz.  The  desired  displacement,  velocity,  and 
acceleration  profiles  are  plotted  in  Figures  9,  10,  and  11,  respectively.  The  desired  beam  tip 

motion,  r(t),  entering  the  controller  of  Figure  2  was  set  equal  to  yd(tf),  i.e.,  only  the  acceleration 
profile  was  used.  The  number  of  weights  used  in  the  adaptive  inverse  model  was  64,  and  the 
delay,  A,  was  29  sampling  periods.  A  controller  design  and  analysis  software  package, 
MatrixX3,  is  used  for  the  simulation  of  the  adaptive  inverse  controller. 

The  position  response  of  the  flexible  beam  for  the  specified  acceleration  profile  is  given  in 
Figure  12  along  with  the  desired  position  response.  A  delay  is  clearly  visible.  Both  the 
commanded  and  the  controlled  tip  motion  are  completed  in  less  than  one  second  as  desired.  The 
difference  between  the  desired  and  actual  displacement  is  shown  in  Figure  13.  The  large  error 
which  peaks  near  0.6  seconds  is  due  to  the  delay  used  in  the  delayed  adaptive  inverse  controller. 
The  delay  (which  is  needed  for  causality)  is  a  function  of  the  propagation  time  through  the  plant 
plus  the  computational  effect.  The  motion  reversal  period  for  this  system  that  is  characteristic  of 
non-minimum  phase  systems  is  shown  in  Figure  14.  This  motion  reversal  period  of  0.08  seconds, 
results  in  an  effective  delay  in  propagation  through  the  plant  dynamics.  Without  this  delay  in  the 
delayed  adaptive  inverse  controller,  this  adaptive  approach  will  not  converge.  The  computed  input 
torque  to  the  plant  is  shown  in  Figure  15.  Note  the  three  characteristics  of  the  generated  controller 
torque;  first,  there  is  no  residual  steady  state  torque;  second,  the  duration  of  the  input  torque  is 
greater  than  the  duration  of  the  desired  motion  due  to  the  delay;  and  third,  there  is  smooth  transition 
at  the  beginning  and  end  of  the  torque  profile.  The  parameters  of  the  delayed  adaptive  inverse 

3  Available  from  Integrated  Systems  Inc.  Santa  Clara.  CA 
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controller  are  obtained  via  an  iterative  manual  optimization  process:  the  values  were  determined  to 

work  best  with  initial  weights  set  at  0.015,  4  set  at  0.075,  and  with  20  iterations  between  each 
sampling  period. 

Although  the  parameters  of  the  delayed  adaptive  inverse  model  are  obtained  via  a  manually 
derived  iteration  approach,  the  parameters  have  predictable  effects  on  the  system  response.  The 

gain  constant,  4,  determines  the  rate  and  stability  of  the  adaptation.  A  large  value  of  4  will  shorten 

the  time  for  convergence  but  it  may  also  cause  instability  of  the  controlled  system.  Effects  of  4  on 

the  system  response  are  shown  for  three  values  of  4  in  Figure  16,  It  is  seen  from  this  figure  that 

the  smaller  the  4 ,  the  larger  the  steady  state  error  will  be,  since  the  convergence  of  the  weights  are 

slower  for  small  values  of  4.  As  expected,  a  large  value  of  4  made  the  system  response  unstable. 

In  this  case,  this  occurred  when  4  was  increased  to  0.076. 

In  Figure  17,  three  responses  are  plotted  for  various  values  of  the  initial  weights.  The  steady 
state  error  increased  for  larger  values  of  the  initial  weights.  In  our  simulations,  the  system 
response  became  unstable  for  initial  weights  greater  than  0.0156  for  this  system.  Therefore,  the 
initial  weights  should  be  chosen  iteratively  to  optimize  the  system  response  so  that  the  control 

error,  (see  Figure  1)  is  minimized  within  the  iterations  performed  during  each  sampling  time 
periods. 

As  discussed  earlier,  the  number  of  the  delay  periods  used  in  the  delayed  adaptive  inverse 
method  reflects  the  propagation  time  from  input  to  output  of  the  plant;  and  the  general  rule  of 
thumb  is  to  set  the  number  of  delay  periods  to  be  half  of  the  number  of  weights.  After  iterative 
simulations,  a  delay  of  29  sampling  periods  (in  our  case,  the  delay  equals  to  29x0.0025  seconds  or 
0.0725  seconds)  was  found  to  be  optimal  for  our  64  weight  FIR  based  controller.  As  shown  in 
Figure  18,  the  larger  the  delay,  the  larger  the  steady  state  error  of  the  system  response.  The  system 
became  unstable  when  the  delay  was  reduced  to  28  sampling  periods.  This  result  indicates  that  the 
propagation  time  of  the  flexible  beam  system  must  be  greater  than  28  sampling  periods. 

A  residual  error  remains  after  the  one  second  settling  time  (see  Figure  13).  This  is  because  the 
adaptive  inverse  method  is  supposed  to  only  yield  an  approximation  of  the  inverse  plant  dynamics. 
A  feedback  control  loop  can  be  incorporated  into  the  delayed  adaptive  inverse  approach  as  shown 
in  Figure  19  in  order  to  remove  the  residual  error  in  the  system  response.  A  linear  feedback 
controller  could  be  used  for  this  purpose  since  the  amount  of  residual  error  is  small  in  magnitude. 
To  illustrate  this  point,  a  simple  proportional  gain  was  used  as  a  feedback  controller  to  eliminate  the 
error  ripples  of  Figure  13.  The  position  response  is  plotted  in  Figure  20.  The  error  between  this 
response  and  the  desired  value  is  shown  in  Figure  21.  Although  it  is  not  clear  from  the  figure,  the 
steady  state  error  has  been  reduced  by  64%. 

We  designed  an  based  loopshaping  controller  using  the  methodology  described  in  Doyle  et 
al.  (1992)  for  controlling  the  tip  motion  of  our  flexible  beam  (i.e.,  the  plant  described  by  Figure  5 
and  equations  (8)  through  (13)).  We  used  the  Robust  Controller  Toolbox  in  MatrixX  for  the 
controller  design.  For  comparison  purpose,  no  uncertainty  was  considered  in  designing  the 
controller,  i.e.,  only  the  nominal  performance  was  considered.  In  an  attempt  to  obtain  the  position 
response  shown  in  Figure  9,  an  initial  design  specification  for  determining  the  ideal  complementaiy 
sensitivity  transfer  function  for  loopshaping  was  formulated  as  a  settling  time  of  one  second  with 
critical  damping  for  a  step  input.  However,  an  optimal  controller  was  not  obtained  since  this 
specification  was  too  stringent  and  not  achievable.  Therefore,  the  design  specification  was  relaxed 
to  a  settling  time  of  one  second  with  an  overshoot  of  less  than  20%  of  the  steady  state  value  for  a 
step  input  The  resulting  H*  based  loopshaping  controller  is  fourteenth  order,  as  a  result  of  the 
order  of  the  system  (which  is  twelfth  order)  and  the  order  of  the  weighting  function  (which  is 
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second  order).  The  step  response  of  the  system  with  the  R*  based  loopshaping  controller  is 
shown  in  Figure  22.  The  design  specification  was  met,  but  an  undesirable  oscillation  appeared 
which  lasted  for  up  to  six  seconds.  The  motion  response  of  the  flexible  beamt  tip  using  this  H« 
based  loopshaping  controller  with  the  desired  input  (fifth  order  polynomial)  as  shown  in  Figure  9 
(which  represents  the  desired  displacement  profile)  is  given  in  Figure  23.  Comparison  of  this 
response  with  the  response  of  the  system  with  an  adaptive  inverse  feedforward  combined  with  the 
linear  feedback  based  controllers  as  shown  in  Figure  20  (and  repeated  in  Figure  23  for  comparison 
purpose)  shows  that  the  settling  time  has  grown  to  3  seconds.  It  would  seem  that  the  adaptive 
inverse  controller  with  a  proportional  controller  results  in  superior  performance.  However,  it  is 
important  to  note  that  the  superiority  of  the  delayed  adaptive  inverse  controllers  over  based 
loopshaping  controllers  is  only  demonstrated  for  nominal  performance  and  aei  for  robust 
performance.  In  fact,  in  the  presence  of  measurement  noise  (we  considered  both  5%  and  10%  of 
the  maximum  torque  input  applied  additively  at  the  plant  input),  the  Hw  based  loopshaping 
controller  yields  a  stable  response  (similar  to  the  response  shown  in  Figure  23)  while  the  delayed 
adaptive  inverse  controller  results  in  an  unstable  system  response.  It  is  therefore  imperative  that 
high  quality  measurements  be  available  when  using  a  delayed  adaptive  controller. 


6.  Conclusion 

The  work  described  here  presents  the  development  of  a  delayed  adaptive  inverse  method  which 
may  facilitate  the  closed-loop  control  of  a  flexible  beam.  An  adaptation  of  a  linear  combiner  was 
achieved  by  using  the  LMS  algorithm  to  find  a  best  fit  to  the  reciprocal  of  a  given  plant  transfer 
function.  A  copy  of  this  delayed  adaptive  inverse  model  was  used  to  compute  the  necessary 
torques  required  to  control  the  non-minimum  phase  system  as  shown  in  Figure  2. 

The  simulation  results  for  the  system  show  that  the  applied  torque  is  bounded  and  that  one  can 
achieve  reasonable  nominal  performance  as  long  as  the  sensing  is  of  high  quality.  Comparison  of 
the  two  system  responses,  one  with  adaptive  inverse  feedforward  and  linear  feedback  controllers, 
and  one  with  H*,  based  loopshaping  controller,  shows  that  the  first  system  has  better  response 
characteristics.  Although  the  delayed  adaptive  inverse  method  works  for  nonlinear  systems,  in  this 
preliminary  study  a  linear  system  was  used  to  demonstrate  the  performance  of  the  delayed  adaptive 
inverse  controller  so  that  the  result  could  be  compared  with  that  of  an  R  based  loopshaping 
controller,  which  is  applicable  to  linear  systems  only.  Both  the  delayed  adaptive  inverse 
controllers  and  the  H«,  based  loopshaping  controllers  ought  to  be  able  to  handle  model  uncertainty. 
This  will  be  a  subject  for  future  investigation. 

Since  there  were  no  squaring,  averaging  or  differentiating  operations  in  the  LMS  algorithm 
(the  main  computational  element  of  this  approach),  the  delayed  adaptive  inverse  method  proposed 
here  is  computationally  efficient.  The  on-line  approach  for  updating  the  weights  described  in 
section  4  updates  all  the  weights  iteratively  20  times  per  sampling  period  (a  not  unreasonable 
expectation  for  a  high  performance  system,  which  in  our  case  iterates  20  times  every  0.0025 
seconds. 

The  delayed  adaptive  inverse  method  uses  only  knowledge  of  the  desired  values  for  the  output 
(specified  as  an  input  r(t)  in  Figure  2)  and  the  actual  plant  output  measurements  in  order  to 
determine  the  control  input,  u(t),  to  the  plant.  This  method  can  be  applied  equally  well  to  linear 
systems,  systems  with  differentiable  nonlinearities,  or  even  unknown  plants  as  long  as  a  control 
input  can  be  applied  and  the  plant  output  can  be  measured.  The  question  of  handling  systems  with 
non-differentiable  nonlinearities  such  as  backlash,  hysteresis,  friction,  etc.  is  still  an  open  one  and 
needs  to  be  addressed  further. 
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Figure  2.  An  adaptive  inverse  model  based  controller 
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Figure  5.  Linear  model  of  the  Picatinny  flexible  beam  testbed 
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Figure  6.  Finite  element  model  of  inertia  w  heel  and  flexible  beam 

(from  B  hat,  1991) 
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Figure  8.  Analytical  vs.  measured  frequency  responses:  torque  to  tip  acceleration 
analytical  result  (solid),  measured  result  (dash) 

(fromBhat,  1991) 
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Figure  9.  Desired  displacement  profile  for  the  flexible  beam 
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Figure  12.  Simulated  response  of  the  flexible  beam  (solid)  and  desired  response  (dash) 
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Figure  13.  Tip  position  error  for  the  delayed  adaptive  inverse  method 
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Figure  14.  Gose-up  of  Figure  12  showing  motion  reversal  phenomenon 


664 


Figure  17.  System  responses  for  various  values  of  the  initial  weights 
0.015  (solid),  0.0152  (dash),  0.0154  (dot-dash) 
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Figure  18.  System  responses  for  various  number  of  delays 
29  (solid),  30  (dash),  32  (dot-dash) 
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Figure  19.  A  closed  loop  control  scheme  based  on  the  adaptive  inverse  method 
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Figure  20.  System  response  based  on  adaptive  inverse  and  feedback  with  proportional  gain 
actual  response  (solid),  desired  response  (dash) 


Figure  21.  Tip  position  error  for  the  adaptive  inverse  and  feedback  with  proportional  gain 


time  (sec) 

Figure  22.  Step  response  of  the  tip  position  with  an  H«  based  loopshaping  controller 
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Figure  23.  Response  of  tip  position  with  an  H*,  based  loopshaping 
controller  for  an  input  shown  in  Figure  9 
actual  response  (solid),  response  of  delayed  inverse  adaptive 
FIR  method  as  in  Figure  20  (dash),  desired  response  (dot-dash) 
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Abstract 

A  preliminary  /^-synthesis  controller  for  the  Army’s  ATB-1000  test  fixture  is  de¬ 
signed  and  analyzed.  For  comparison,  two  SISO  controller  designs  are  also  described. 
The  test  fixture  is  pattered  after  the  Apache  helicopter’s  30  mm  gun  and  has  tunable 
nonlinearities  which  may  be  representative  not  only  of  the  nonlinearities  of  the  gun, 
but  of  other  mechanical  systems  as  well.  The  model*  of  the  test  fixture  which  were 
available  at  the  time  of  the  work  are  also  described.  The  goal  in  pointing  the  gun  is 
to  reduce  dispersions  of  fired  gun  rounds  on  targets.  The  resulting  /i-synthesis  de¬ 
sign,  when  connected  with  a  nonlinear  simulation,  exhibited  limit-cycle  behavior  of 
unacceptable  amplitude.  The  unacceptable  performance  is  due  to  the  nonlinearities 
and,  in  future  work,  would  be  improved  upon  by  frequency  domain  trade-off*  during 
the  synthesis  step. 


1  Introduction 

For  proper  overall  functioning  of  most  of  the  Army’s  weapons  systems,  specific  subsystems 
demand  high  precision  control.  For  example,  a  guided  munition  system  may  be  fitted 
with  laser  systems  for  ranging  and/or  targeting.  Both  of  the  laser  subsystems  call  for 
accurate  pointing  control  systems.  These  are  in  addition  to  the  high  performance  guidance 
and  control  of  the  munition  itself.  Tank  and  gun  systems  require  stabilised  platforms 
from  which  rapid  firing  and  re-targeting  occur.  Stabilised  platforms  are  also  necessary 
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for  antenna  systems  and  video  camera  systems  which  are  envisioned  in  future  battlefield 
scenarios.  Oftentimes,  the  accuracy  of  these  control  laws  is  limited  by  the  mechanical 
system  itself,  for  example,  dead  zones  in  gear  drives,  or  friction  in  bearings. 

The  Army  Research  Office  has  built  a  laboratory  fixture  to  study  control  laws  for 
problems  that  are  dominated  by  “hard”  nonlinearities.  Example  nonlinearities  in  this 
group  are  saturations,  static  friction  effects,  and  gear  backlash.  The  fixture,  the  ATB- 
1000,  is  patterned  after  the  Apache  helicopter  gun,  and  has  built-in  tunable  nonlinearities. 
It  is  ideal  for  studying  problems  in  application  of  linear  and  nonlinear  control  law  designs. 

This  paper  offers  three  potential  linear  control  designs  for  the  ATB-1000.  Section  2 
briefly  discusses  the  objectives  for  the  design,  and  Section  3  describes  the  models  available 
for  design  and  analysis.  Section  4  discusses  the  three  different  designs,  and  Section  5 
contains  some  nonlinear  simulation  and  linear  analyses  for  one  of  the  designs.  Finally 
Section  6  summarizes  the  work. 


2  Requirements 

The  ATB-1000  is  a  test  fixture  patterned  after  the  Apache’s  30  mm  gun.  The  basic 
goal  for  this  weapon  is  to  reduce  dispersion  of  its  rounds  on  targets.  So  the  objective 
for  the  ATB-1000  is  to  minimize  the  barrel  pointing  angle  deviation  from  a  commanded 
value  in  the  presence  of  platform  motion  (simulated  with  disk  motion),  gun  firing-induced 
transients  (simulated  with  a  solenoid),  and  mechanism  nonlinearities  (simulated  with  ad¬ 
justable  backlash  and  friction).  The  laboratory  fixture  (see  Figure  1)  is  outfitted  with  a 
laser  arm  to  accurately  measure  the  barrel  tip  position  and  hence  experimentally  deter¬ 
mine  performance.  There  are  also  disturbance  levels  and  ranges  of  parametric  nonlinearity 
adjustments  that  are  part  of  the  requirements. 

3  Models 

In  practice,  the  development  of  a  successful  control  system  design  is  highly  dependent  on 
obtaining  representative  models  of  the  system  to  be  controlled.  The  models  are  a  direct 
input  to  the  control  law  synthesis  and  analysis  steps  in  the  development  process.  Models 
and  modeling  data  come  in  many  different  forms,  and  different  types  of  models  are  used  for 
different  purposes.  Two  distinct  models  of  ATB-1000  test  fixture  were  examined  as  part  of 
this  preliminary  control  design  effort.  These  two  models  will  be  described  and  compared 
in  this  section.  Some  discrepancies  between  these  two  models  have  been  identified  and  it 
will  be  necessary  to  resolve  them  for  future  studies. 

During  the  summer  of  1991,  modeling  data  was  received  and  analyzed  from  the  Army 
Research  Office.  The  modeling  material  consisted  of  MatrixX  block  diagrams,  tables  con¬ 
taining  definitions,  scale  factors,  sign  conventions,  units,  signal  size  information,  and  linear 
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Figure  1.  Schematic  of  ATB-1000  test  fixture. 
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models  for  the  truncated  finite  element  model  of  the  barrel  (with  8  states)  and  a  27th  order 
identification  model.  This  section  will  refer  to  an  analytical  model  and  an  identification 
model.  The  analytical  model  is  based  on  the  block  diagrams,  tabular  data,  first  principles 
of  dynamics,  and  includes  four  relay-type  nonlinearities  and  two  deadband-type  nonlinear¬ 
ities.  The  identification  model  is  a  linear  model  whose  inputs  and  outputs  are  a  subset  of 
those  in  the  analytical  model. 

The  MatrixX  block  diagrams  and  the  tabular  data  were  used  to  generate  a  linear  model 
and  a  nonlinear  simulation  using  Honeywell  computer  tools.  The  linear  model  treats  the 
deadband  as  a  unity  operator  which  neglects  backlash,  and  the  relay  as  a  zero  operator 
which  neglects  friction.  The  linear  model  was  then  examined  in  terms  of  its  poles,  transfer 
function  zeros  for  certain  inputs  and  outputs,  and  time  and  frequency  responses.  There  are 
degrees-of-freedom  for  the  disk  translation  and  rotation  in  a  plane,  motor  rotation,  inertia 
wheel  rotation,  laser  arm  motor  rotation,  laser  arm  rotation,  and  three  elastic  degrees  of 
freedom  for  the  gun  barrel  (simulated  with  a  rod  attached  to  the  inertia  wheel). 

The  linear  open  loop  model  consists  of  two  physical  systems:  the  disk,  inertia  wheel, 
and  rod  system,  and  the  laser  arm  system.  There  is  a  motor  associated  with  each  system. 
There  are  five  pairs  of  open  loop  poles  at  the  origin  (because  friction  is  neglected)  associated 
with  the  rigid  body  degrees-of-freedom.  There  are  two  pairs  of  complex  poles  associated 
with  the  compliances  in  both  systems,  and  there  are  three  pairs  o*  complex  poles  associated 
with  the  gun  barrel  resonances  with  small  damping  Tatios. 

The  plant  transfer  function  between  the  control  motor  torque  and  the  barrel  point¬ 
ing  angle  can  be  regarded  as  a  double  integrator  (at  frequencies  below  10  rad/sec)  with 
disturbances  (from  disk  motion)  and  some  high  frequency  elastic  modes.  The  nonlinear 
simulation  was  executed  with  different  test  inputs  to  assess  its  behavior. 

An  identification  (ID)  model  was  obtained  in  a  state  space  format  with  seven  outputs, 
one  input,  and  27  states.  The  outputs  are  torque  motor  resolver,  backlash  resolver,  disk 
velocity,  quadcell  output,  strain  gauge  #1,  strain  gauge  #2,  and  torque  motor  tachometer, 
the  input  is  the  control  motor  torque,  and  the  27  states  are  not  physically  defined  but  the 
linear  ID  model  fits  the  data  from  the  identification  experiments.  This  model  was  compared 
to  the  analytical  model  in  terms  of  poles,  frequency  response,  and  time  histories. 

The  ID  model  shows  more  open  loop  damping  e.g.  £jO=0.07  versus  7-n-,tv,,r=flfi1 
for  the  first  elastic  mode  (near  31  rad/sec)  and  C/n= 0.15  versus  Canaiw«=°-084  for  the 
shaft  compliance  mode  (near  55  rad/sec)  between  torque  motor  and  inertia  wheel.  The 
low  frequency  behavior  of  the  ID  model  shows  a  slope  of  -1  on  a  Bode  gain  plot  versus 
the  slope  of  -2  in  the  analytical  model,  because  friction  is  present  in  the  identification 
experiment,  but  neglected  in  analytical  model.  In  addition,  the  low  frequency  accuracy  of 
the  ID  model  is  limited  by  the  length  of  time  used  for  the  identification  experiment.  Thus 
the  ID  model  is  not  close  to  the  analytical  model  for  frequencies  below  10  rad/sec.  Except 
for  the  poles  and  low  frequency  asymptote,  the  ID  and  analytical  models  agree  for  torque 
motor  resolver,  backlash  resolver,  and  torque  motor  tachometer  outputs.  On  the  other 
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hand,  the  ID  and  analytical  models  for  strain  gauge  #2  show  180  deg  phase  discrepancies, 
and  the  the  quadcell  output  does  not  show  close  agreement  at  any  frequency. 

For  future  design  work,  it  will  be  necessary  to  resolve  these  discrepancies  before  closed 
loop  testing  can  be  performed.  The  ID  model  was  utilized  (despite  these  discrepancies) 
for  demonstration  of  the  /i-syntheais  design  methodology.  Actually  a  balanced  realization 
of  the  ID  model  was  truncated  to  twelve  states  for  the  /i-aynthesis  design.  The  analytical 
model  was  utilized  to  develop  alternate  control  laws  with  classical  approaches.  One  of 
these  classical  alternates  uses  motor  tachometer  feedback,  and  the  other  has  lead  and 
notch  compensation  of  the  inertia  wheel  position.  The  next  section  discusses  each  of  these 
three  designs. 


4  Control  Law  Design 

In  this  section  a  preliminary  design  effort  for  the  ATB-1000  test  fixture  will  be  described 
in  detail.  The  design  is  incomplete,  but  adequately  serves  as  a  starting  point  for  future 
work.  To  limit  tne  scope  of  the  preliminary  effort,  the  “hard”  nonlinearities  were  neglected 
for  the  control  synthesis.  However  closed  loop  simulations  were  carried  out  where  the 
nonlinearities  were  included.  These  preliminary  simulations  showed  that  the  nonlinearities 
are  significant  and  it  will  be  necessary  to  include  them  in  future  designs.  In  this  preliminary 
look  at  control  law  design,  three  design  approaches  were  considered.  Two  approaches  were 
SISO  and  one  was  multivariable  /i-synthesis.  The  SISO  designs  are  of  interest  because 
they  correspond  to  minimal  sensor  requirements.  The  /i-synthesis  approach  is  of  interest 
because  the  nonlinearities  are  accounted  for  by  treating  them  as  bounded  operators. 

The  control  problem  is  to  point  the  gun  barrel  in  the  face  of  disturbances.  For  the 
demonstration  design  presented  here,  the  pointing  was  quantified  in  terms  of  the  quadcell 
output  and  only  the  solenoid  disturbance  was  included  in  the  design  objective.  Model 
uncertainty  was  incorporated  with  a  multiplicative  perturbation  at  the  torque  motor  loca¬ 
tion.  Sensor  noise  was  also  included  in  the  formal  ^-synthesis  problem  statement.  More 
detailed  designs  would  incorporate  frequency  domain  weighting  transfer  functions,  which 
act  as  linear  bounds  for  the  effects  of  the  six  system  nonlinearities.  Requirements  would 
also  be  defined  and  incorporated  for  actuator  activity  and  physical  limitations. 

It  was  necessary  to  append  a  solenoid  disturbance  input  (which  simulates  gun  firing) 
to  the  ID  model.  This  was  done  by  selecting  a  constant  gain  matrix  from  the  frequency 
response  of  the  analytical  model  near  the  first  elastic  mode  frequency.  This  is  an  approx¬ 
imation  used  for  expediency  during  this  preliminary  design.  In  a  more  detailed  design 
effort,  the  effect  of  the  disturbance  input  on  the  equations  of  motion  would  be  included 
more  carefully  into  the  state-space  matrices  for  the  interconnection  structure  used  for  fi- 
synthesis. 

It  is  worth  noting  that  the  gun  stabilization  fixture  is  similar  to  a  particular  elastic 
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structure  control  problem  which  has  received  a  large  amount  of  attention  in  the  control 
and  modeling  literature.  In  addition,  experimented  studies  have  been  performed  at  various 
laboratories  [3,  2].  The  problem  is  that  of  rotating  disks  (at  least  two)  that  are  connected 
with  rods  that  are  elastic  in  torsion.  These  studies  motivated  the  first  design. 


Colocated  SISO  Design  One  of  the  SISO  designs  was  for  colocated  feedback  between 
the  torque  motor  resolver  and  the  motor  torque.  This  choice  waa  motivated  by  the  knowl¬ 
edge  that  under  certain  assumptions  regarding  a  lower  bound  for  inherent  structural  damp¬ 
ing,  and  sufficiently  high  bandwidth  sensors,  computers,  and  actuators,  such  a  mechanical 
system  can  be  robustly  stabilized  with  colocated  sensors  and  actuators  even  in  the  pres¬ 
ence  of  some  significant  nonlinearities.  When  the  sensor  and  actuator  are  not  colocated, 
robust  stabilization  is,  in  general,  more  difficult  to  achieve  due  to  limitations  imposed  by 
non-minimum  phase  aspects.  [4,  5,  1] 

The  reduced  order  ID  model  was  utilized  to  determine  the  feedback  compensation. 
Recall  that  the  transfer  function  has  a  1/s  shape  below  10  rad/sec  in  the  case  of  the  ID 
model.  Thus  a  pure  gain  can  be  selected  to  set  the  unit  loop  gain  crossover  at  10  rad/sec 
as  a  preliminary  design  choice.  Higher  frequency  resonances  are  stabilized  because  of  the 
colocation  and  the  assumptions  about  inherent  damping,  sensors,  and  actuators.  A  higher 
crossover  could  be  considered  but  this  would  require  more  accurate  modeling  of  even  higher 
frequency  elastic  behavior  and  tighter  requirements  on  sensors  and  actuators.  A  pure  gain 
feedback  between  motor  position  and  motor  torque  would  not  be  stabilizing  if  connected 
to  the  analytical  model  because  it  has  a  1/s3  shape  below  10  rad/sec  as  discussed  above. 


Noncolocated  SISO  Design  The  other  SISO  design  was  developed  with  the  analytical 
model  for  noncolocated  feedback  between  the  inertia  wheel  encoder  and  motor  torque.  In 
this  case  a  lead  compensation  element  was  employed  to  create  a  unit  loop  gain  crossover 
at  10  rad/sec.  In  this  case,  some  of  the  higher  frequency  resonances  are  destabilized  by 
the  noncolocated  feedback.  To  prevent  this  destabilization,  notch  filters  were  included  for 
the  first  elastic  mode  and  the  compliant  mode  between  the  motor  and  inertia  wheel.  This 
design  approach  is  of  interest  (as  compared  to  the  colocated  design)  because  the  colocated 
motor  position  is  not  as  closely  related  to  the  pointing  angle  as  is  the  inertia  wheel.  This 
design  also  has  value  as  a  further  comparison  against  the  (i- synthesis  design. 


Mu-Synthesis  Design  The  /i- synthesis  design  approach  is  multivariable  and  is  cast 
in  terms  of  the  interconnection  structure  shown  in  Figure  2.  There  is  a  multiplicative 
perturbation  at  the  torque  motor  location  represented  by  A  and  the  input  Vi  and  output 
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Figure  2.  Block  diagram  of  interconnection  structure. 


2\.  There  is  a  performance  output  called  er^  which  is  the  quadcell  output  passed  through 
a  weighting  function.  The  external  inputs  are  sensor  noise  and  the  solenoid  disturbance. 
There  is  also  the  torque  motor  input  and  the  seven  sensors  to  close  the  feedback  loop  with 
the  compensator  K. 

The  interconnection  structure  includes  weighting  transfer  functions  for  uncertainty 
bounds,  performance  requirements,  and  disturbances.  The  uncertainty  was  modeled  as 
a  multiplicative  perturbation  and  was  bounded  with  a  third  order  Butterworth  filter  hav¬ 
ing  break  frequencies  at  20  and  300  rad/sec  and  a  high  frequency  gain  of  675.  This  can 
be  interpreted  as  20%  model  error  below  20  rad/sec  and  67,500%  model  error  above  300 
rad/sec.  The  pointing  requirement  is  included  by  weighting  the  quadcell  output  with  a 
low  pass  transfer  function  360(j  +  10)/(sJ  +  84 s  +  60a).  This  has  unit  steady-state  gain, 
so  outputs  of  less  than  1  volt  would  be  acceptable.  The  seven  sensor  noises  are  weighted 
with  the  constant  value  of  0.01,  so  this  corresponds  to  either  volts  or  counts  depending  on 
the  sensor.  Finally,  the  solenoid  disturbance  is  weighted  with  a  low  pass  transfer  function 
0.3/(s  + 10),  so  inputs  of  0.03  volts  are  expected.  The  weightings  were  not  carefully  related 
to  the  hardware  in  this  preliminary  design  demonstration.  This  relationship  should  be 
more  carefully  addressed  to  better  account  for  known  hardware  characteristics.  In  particu¬ 
lar  a  weighting  for  the  disturbance  would  take  into  account  the  duty  cycle  of  the  solenoid. 
Additional  inputs  and  outputs  as  well  as  weightings  could  be  utilized  to  represent  the 
nonlinearities  which  have  not  been  accounted  for  in  the  preliminary  design. 

The  state  space  solution  to  the  H°°  control  synthesis  problem  was  used  to  find  a  feed¬ 
back  compensator  K.  This  compensator  has  as  many  states  (18)  as  the  interconnection 
structure  and  it  was  possible  to  reduce  the  compensator  order  by  residualization  to  16 
states.  The  closed  loop  transfer  function  is  denoted  by  M  and  connects  the  inputs:  Vj, 
solenoid,  and  sensor  noise  to  the  outputs:  z%,  and  quadcell. 

The  next  step  in  the  ^-synthesis  design  was  to  introduce  £>- scales  to  properly  account  for 
the  model  uncertainty  and  performance  variable  response  to  external  inputs.  A  constant 
Z}-scale=3  was  employed  because  a  dynamic  D-scale  was  not  deemed  necessary  in  this 
preliminary  design.  The  D-scale  was  incorporated  by  multiplying  z\  by  3  and  dividing 
by  3  (i.e.,  DMD~l)  and  a  new  interconnection  structure  P  was  established.  The  H°° 
problem  was  then  re-solved  for  the  compensator  K  and  the  iterations  were  terminated. 
Detailed  analyses  of  this  compensator  appear  in  the  following  section. 


5  Analyses 

The  ft-synthesis  results  are  graphed  in  Figure  3.  There  are  five  plots  of  Bode  magnitude 
versus  frequency.  The  top  curve  is  relatively  flat  because  it  is  the  maximum  singular  valve 
of  the  closed  loop  interconnection  structure  (d[M]),  and  H°°  optimization  makes  its  peak 
value  as  small  as  possible.  The  next  curve  down  is  the  structured  singular  value, 
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and  is  necessarily  less  than  or  equal  to  the  upper  curve,  since  a[M)  is  a  theoretical  upper 
bound  for  (i[M j.  There  is  a  low  frequency  difference  between  the  structured  and  maximum 
singular  values,  which  indicates  that  performance  improvements  are  possible  by  further 
D  K  iteration  and  frequency  dependent  D- scales. 

The  next  two  curves  in  Figure  3  correspond  to  robust  stability  and  nominal  perfor¬ 
mance.  Theoretically  these  curves  are  less  than  or  equal  to  the  structured  singular  value 
and  this  is  consistent  with  the  numerical  results.  The  robust  stability  curve  is  relative  to 
to  the  defined  multiplicative  perturbation,  and  dominates  (i[M)  at  higher  frequencies.  The 
robust  stability  curve  can  be  further  interpreted  as  the  weighted  complementary  sensitiv¬ 
ity,  where  the  weighting  is  the  bound  for  the  multiplicative  perturbation.  The  nominal 
performance  curve  is  the  maximum  singular  value  of  the  transfer  function  matrix  between 
the  weighted  quadcell  and  the  external  inputs  including  the  weighted  solenoid  disturbance 
and  sensor  noise.  This  curve  dominates  fi[M]  at  low  frequencies  and  can  be  further  in¬ 
terpreted  as  the  weighted  sensitivity.  The  lowest  curve  in  the  figure  corresponds  to  the 
weighted  quadcell  response  due  to  sensor  noise.  This  is  more  than  an  order  of  magnitude 
less  than  /x[M],  so  the  quadcell/sensor  noise  path  does  not  have  much  influence  on  the 
optimal  design. 

Further  analyses  of  the  ^-synthesis  design  were  carried  out  to  assess  closed  loop  poles, 
input  and  output  loop  properties,  and  time  response  to  solenoid  disturbances.  The  closed 
loop  poles  indicated  closed  loop  stability  and  damping  improvements  for  the  first  elastic 
mode  ((cl  =  0-12  versus  (ol  —  0.08)  and  compliant  mode  ((cl  —  0.21  versus  (ol  —  0.14). 
Gain  and  phase  margins  for  the  SISO  loop  transfer  function  at  the  torque  motor  actuator 
location  were  evaluated.  The  lowest  frequency  unit  gain  crossover  occurs  at  5.8  rad/sec 
with  a  phase  margin  of  81  degrees.  The  phase  margins  surrounding  the  first  elastic  mode 
frequency  are  larger  than  43  degrees.  All  gain  margins  are  larger  than  7  db.  These  are 
considered  good  margins  with  respect  to  model  uncertainty  at  the  actuator  location. 

The  linear  closed  loop  system  was  simulated  with  the  disturbance  model  used  for  the 
/^-synthesis  design.  This  disturbance  model  is  a  constant  gain  matrix  between  the  solenoid 
and  the  measurements  including  the  quadcell  output.  Thus  this  model  is  only  accurate  near 
the  first  elastic  mode  frequency  and  is  not  accurate  at  low  or  high  frequencies.  The  distur¬ 
bance  input  was  a  10  Hz  sequence  of  10  msec,  1  volt  pulses.  (See  Figure  4a.)  The  quadcell 
output  response  is  dominated  by  the  the  compliant  mode  because  the  pulse  frequency  is 
close  in  proximity  to  the  compliant  mode  frequency.  The  quadcell  output  during  the  10 
msec  solenoid  firing  is  not  accurate,  so  disregarding  these  portions  of  the  response,  the 
quadcell  output  shows  a  residual  oscillation  near  the  compliant  mode  frequency  with  less 
than  3  volts  peak- to- peak.  (See  Figure  4b.)  This  is  not  considered  satisfactory  performance 
and  the  interconnection  structure  should  be  further  refined  to  improve  the  performance  by 
making  better  tradeoffs  with  the  weighting  functions. 


680 


solenoid  disturbance 


i  |  I  I — I — I — | — I — I — I — l — | — l — I — i — i — |  i  i  i  r  | 
0.35  0.40  0.45  0.50  0.55 

tine  (seconds) 


b) 

Figure  4.  Time  histories  using  mu-synthesis  design. 
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6  Summary 

A  preliminary  p- synthesis  control  law  design  and  some  associated  analyses  have  been  car¬ 
ried  out  for  the  ATB-1000  test  fixture.  Two  other  SISO  controllers  were  designed  for 
comparison,  but  only  the  description  of  the  designs  appear  here.  The  ^-synthesis  design, 
when  simulated  with  a  nonlinear  model  of  the  test  fixture,  exhibited  unacceptable  limit- 
cycle  behavior.  In  future  work,  the  limits  to  achievable  performance  will  be  established  by 
quantifying  the  key  tradeoffs  in  terms  of  p  and  plots  like  Figure  3.  It  is  expected  that  he 
performance  will  be  improved  but  still  limited,  for  example,  by  a  certain  nonlinearity  or 
a  particular  pulse  frequency.  The  ^-synthesis  methodology  is  well-suited  for  sorting  out 
such  issues. 
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ABSTRACT 


In  this  paper  we  introduce  a  new,  model  based,  nonlinear  control  method,  referred  to  as 
the  Trajectory  Pattern  Method.  The  Trajectory  Pattern  Method  is  based  exclusively  on  the 
desired  behavior  of  the  system.  We  show  that  this  method  is  robust,  allows  for  long  term 
planning,  and  can  be  used  to  adaptively  control  systems  with  parameter  uncertainty. 

1.  INTRODUCTION 

The  Trajectory  Pattern  Method  is  a  new  inverse  dynamics  based  controller.  The  object 
of  this  paper  is  to  explain  the  Trajectory  Pattern  Method  and  to  show  some  of  its  potential 
applications  in  control  of  nonlinear  systems.  Such  inverse  dynamics  based  controllers  have 
been  extensively  applied  to  the  tracking  control  of  robot  manipulators  and  other  similar 
nonlinear  dynamics  systems  [1-3,13-15].  One  major  drawback  of  the  presently  available 
model  based  controllers  is  their  high  sensitivity  to  the  model  parameter  inaccuracies  and 
variations.  In  order  to  overcome  this  problem,  a  number  of  learning  and  adaptive  control 
schemes  [16,17]  and  parameter  identification  and  calibration  algorithms  have  been  developed 
to  compensate  the  effects  of  model  parameter  uncertainties  [18-21],  Here,  no  attempt  is  made 
to  present  a  comprehensive  review  of  the  literature. 

The  authors  have  been  developing  the  Trajectory  Pattern  Method  [1-3]  over  the  past  three 
years.  To  date,  this  method  has  been  applied  to  the  study  of  the  inherent  characteristics  of 
the  nonlinear  dynamics  of  manipulators  [4-5];  the  problem  of  manipulator  type  synthesis  for 
minimal  high  frequency  vibrational  excitation  [6];  the  study  of  the  effects  of  the  payload  on 
the  vibrational  excitation  during  motion  [7];  trajectory  synthesis  for  robot  manipulators  for 
minimal  vibrational  excitation  due  to  the  payload  [12];  the  trajectory  synthesis  for  minimal 
residual  vibration  of  the  tip  motion  of  a  high  speed  positioning  machine  with  structural 
flexibility  [8],  a  flexible  beam  [9],  a  system  with  joint  and  structural  flexibility  [10];  to  a 
system  with  joint  flexibility  for  minimum  attainable  time  and  minimal  residual  vibration 
[11];  and  the  tracking  control  of  robot  manipulators  [1]. 

In  this  paper,  we  develop  the  basic  structure  of  the  Trajectory  Pattern  Method.  The 
emphasis  is  on  systems  described  by  finite  dimensional  ordinary  differential  equations.  The 
robustness  in  the  presence  of  full  and  partial  control  is  discussed.  The  approach  can  accom¬ 
modate  holonomic  and  nonholonomic  constraints.  In  section  3,  we  apply  this  methodology  to 


683 


develop  nonlinear  adaptive  and  efficient  model  parameter  uncertainty  compensation  schemes. 
A  rigorous  theoretical  treatment  of  these  topics  will  be  presented  in  a  forthcoming  paper  (22]. 

2.  TRAJECTORY  PATTERN  METHOD 

In  this  section  we  present  the  Trajectory  Pattern  Method  in  some  detail.  We  show  that 
it  is  a  comprehensive  method  for  motion  planning  and  control  with  open  and  closed  loop 
control. 

We  restrict  ourselves  to  those  systems  whose  control  is  determined  by  ordinary  differential 
equations.  For  the  sake  of  simplicity,  we  only  consider  mostly  second  order  systems.  We  first 
consider  the  case  of  full  control. 

Full  Control 

Consider  a  second  order,  fully  coupled,  differential  control  system  on  a  configuration 
space  C  which  is  an  open  subset  of  R'1: 

A(x,x)x  +  B(x,  x)  =  u 

u  is  in  the  input-space  Rn.  The  state  space  S  for  this  system  is  C  x  Rn.  We  assume  that 
A  and  B  are  smoothly  varying  and  that  for  each  (x,x)  €  S,  A(x,  x)  is  an  invertible  n  x  n 
matrix. 

Examples  of  such  differential  control  systems  are  rigid  robotic  systems  with  actuators  at 
all  the  joints.  Here  u  is  the  vector  of  torques  delivered  at  all  the  joints  and  the  differential 
equation  represents  the  balance  of  dynamic  forces  and  applied  forces.  If  the  torque  is  de¬ 
livered  through  a  DC  engine  and  is  controlled  by  a  voltage  to  the  engine,  then  the  voltage 
determines  the  rate  of  change  of  the  torque  and  by  differentiating  the  balance  equation  we 
obtain  an  equation  of  degree  3,  in  terms  of  the  voltage  control.  The  Trajectory  Pattern 
Method  extends  to  higher  degree  equations  in  a  self-evident  manner. 

The  Trajectory  Pattern  Method  is  a  generalization  of  the  the  computed  torque  method, 
which  we  briefly  recall.  Given  a  trajectory  7(0  in  the  configuration  space  C.  Define  the 
control  u(t)  as: 

u(t)  =  A(i(t),  7(f))  7(<)  +  R(7(<),7(t)) 

Then  if 

(z(f0),i(*o))  =  (7(<o),7(*o)) 
and  x(t)  is  defined  as  the  solution  of  the  open  loop  control: 

A(x,x)x  -f  B(x,x))  =  u(t) 


then 

x(t)  =  7(<) 

The  main  idea  of  the  Trajectory  Pattern  Method  is  that  if  one  has  a  family  of  trajectories 
which  accounts  for  all  the  different  initial  conditions,  then,  by  applying  computed  torque, 
the  system  will  follow  for  any  initial  condition  one  trajectory  in  the  family.  Usually,  these 
trajectories  are  chosen  so  that  the  end  condition  represents  a  desired  state  of  the  system. 
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Definition:  A  trajectory  pattern  4>{a,t)  over  a  time  interval  [f0,fi)  with  trajectory 
parameters  a  in  the  space  P  is  a  smooth  map: 

<f>:  P  x  -*  C 

so  that  for  all  t  £  [t0,fi),  the  map: 

a  -*  $(<M)  =  (<f>(a,t),dt<f>{a,t)) 
is  a  diffeomorphism  between  P  and  S. 

Remark:  For  an  nth  order  system  one  would  consider  the  map 


Examples: 

1.  C  =  Rn,  S  =  P  =  Rn  x  RT, 


=  ait2  +  13 


on  the  time  interval  [—1,0).  Then 


-  (f,  ,';oc;) 


All  trajectories  pass  through  0  at  time  0,  with  zero  velocity.  V>  e  sketch  a  few  of  these  patterns 
for  dimension  n  =  1  in  Fig.  1. 

2.  C  =  Rn,  S  =  P  =  Rn  x  7T, 

<f>((ai,  a2),t )  =  aisin(irt)  +  a2sin(2irt) 

on  the  time  interval  [—1/2,0) 

_  (  sin(irt )  iin(2T<)  \  /  ax  \ 

Vai,<*2i  )  —  costal}  2ir cos(2irt)  /  \a2J 

All  trajectories  are  periodic  and  pass  through  0  at  integer  times  with  varying  velocity,  see 

Fig.  2. 

In  general,  for  motions  that  must  satisfy  a  certain  number  of  end  conditions,  an  appro¬ 
priate  trajectory  pattern  is  readily  selected.  Such  a  trajectory  pattern  can  be  additionally 
selected  on  the  basis  of  some  optimality  criterion. 

Given  a  trajectory  pattern  4-  It  defines  a  control  w^(x,x,  t)  in  the  following  manner: 

u*(x,x,f)  =  A(x,x)d?<j>(a(t),t)  +  f?(x,x) 

where  a(f)  =  $-1(x(f),x(f),<). 

The  open  loop  control  system  defined  by  the  trajectory  pattern  4>  is  the  following: 

A(x,x)x  +  B(x,x)  =  u$(x,x,f) 


685 


Proposition:  Given  any  initial  condition 

(x(f0),x(<o))  €  S 

The  solution  x(t)  of  the  control-system  defined  by  <f>  is  the  trajectory  <t>(a(t0),t),  where  the 
parameter  a(t0)  equals: 

a(t0)  =  #'"1(x(<0),..,*(**'1)(*o)i*b) 

Proof:  The  solution  x(t)  satisfies  the  differential  equation: 

x  =  df<f>(a(t),t) 

The  trajectory  <f>(a(t0),t)  satisfies  this  differential  equation  and  has  the  same  initial  con¬ 
ditions  at  time  to-  Therefore  by  uniqueness  of  solutions  to  differential  equations,  x(t)  = 
cj>(a(t0),t).  Q.E.D. 


Remark:  Observe  that  another  way  of  stating  this  proposition  is  that  the  parameter 
a(t)  are  constant  and  solely  determined  by  the  initial  condition. 

Example:  Consider  the  simplest  control  equation: 

Ax  =  u 


Consider  trajectories: 


<f>(aua2tt)  =  axt  +  a2t2 


Then  the  map  $(aita2lt)  equals: 


/  ax  t  +  a2 12  \  _  /  t  t2\  (  aj  \ 
V  ai  +  2 a2 1  )  ~  \  1  2 t)\a2) 


Therefore: 


and 


(s$) = -(£;**) 


,—x  x, 


u+{x,x,t)  =2  Aa2{t)  =2  A(~  +  y) 
The  open  loop  control  equation  is  then: 


Ax=2A(~  +  f) 


i.e. 


,  —x  x. 


4  -  2<~  +  7> 

its  solutions  are  linear  combination  of  the  functions  t  and  t2. 
Robustness 
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We  are  considering  systems  that  are  accurately  modeled  by  the  equations  of  motion.  In 
that  case,  these  equations  of  motion  still  depend  on  system  parameters  such  as  physical 
parameters  (masses,  lengths,  etc.).  The  previous  discussion  then  represents  the  ideal  case: 
the  system  parameters,  i.e.,  A  and  B  are  known  exactly.  The  first  question  then  is  how  to 
define  the  controls  if  the  parameters  are  known  only  up  to  some  accuracy  and  to  determine 
the  robustness  of  this  method.  In  the  simplest  implementation  of  the  Trajectory  Pattern 
Method  we  attempt  to  control  the  real  system  {A,  B),  using  as  control  the  Trajectory  Pattern 
Method  control  for  an  approximate  system  { A,B ).  This  means  that  given  the  trajectory 
pattern  <f>.  define  the  control  u^(a:,  i,  f)  as: 

='  A(x,x)df<f>(a(t),t)  +  B(x,x) 

Then  one  obtains  as  open  loop  control  for  the  real  system: 

A(x,x)x  +  B(x,x)  =  A(x,x)d?<j>(a(t),t)  +  B{x,x) 

We  now  explain  why  this  method  is  robust.  We  know  the  solution  of  this  system  exactly 
when  A  —  A  and  B  —  B,  namely  the  trajectory  pattern  trajectories.  By  continuous 
dependence  on  parameters  of  solutions  to  differential  equations,  we  conclude  that  for  any 
compact  time  interval  and  any  compact  set  of  initial  conditions,  the  solutions  to  this  system 
are  close  to  the  trajectory  patterns  provided  (A,  B)  and  ( A,B )  are  sufficiently  nearby  on  a 
neighborhood  of  the  region  traversed  by  the  patterns  starting  in  this  compact  set  of  initial 
conditions  during  the  time  interval. 

The  previous  robustness  results  are  not  so  useful  when  the  system  runs  for  a  fairly 
long  time,  since  errors  tend  to  build  up  (long  term  memory).  Such  effects  need  particular 
consideration  if  one  wishes  to  regulate  a  system.  We  now  discuss  particular  trajectory 
patterns  which  tremendously  reduce  the  build  up  of  long  term  errors  and  which  are  moreover 
useful  when  careful  aiming  is  necessary.  Those  patterns  focus  on  particular  end  conditions. 

We  say  that  a  trajectory  pattern  <)  on  the  time  interval  [f0,fi)  focuses  at  time  1 1  if 
and  only  if  the  map  $(a,fi)  is  well-defined  and  has  lower  dimensional  image. 

We  present  examples  of  such  focusing  patterns  in  dimension  n: 

1.  <j>{ai,a2,t)  =  *o  +  a\t  +  a3t2.  This  pattern  focuses  so  that  at  time  t  =  0  all  the 
trajectories  pass  through  i0,  with  variable  velocity. 

2.  <f>(al,a2,t)  =  x0  +  v0t  +  ait2  +  a2t3.  This  pattern  focuses  at  time  t  =  0  to  the  point 
(x0,Uo))  he.  all  trajectories  pass  at  time  t  =  0  through  the  point  x0  with  velocity  u0. 

3.  0(ai,a2,<)  =  x0  +aistn(<)  +  a2sin(2t).  This  pattern  is  periodic  and  focuses  at  time 
t  =  n  ir  to  the  point  x0. 

It  is  an  interesting  fact  that  the  property  of  having  focusing  trajectories  is  robust  for  the 
Trajectory  Pattern  Method: 

Given  a  control  system  (A,  B)  and  a  trajectory  pattern  which  at  time  t  =  tQ  focuses  to  a 
region.  Given  a  compact  set  of  initial  conditions  then  if  (A,  B)  is  sufficiently  close  to  (A,  B) 
all  solutions  of  the  control-equations  with  initial  conditions  in  the  compact  set  focus  at  time 
<o  to  the  same  region. 
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In  particular,  if  the  trajectory  patterns  focus  on  point,  the  trajectories  of  (A,  B)  focus  on 
the  same  point.  Moreover,  if  the  trajectory  patterns  focus  periodically ,  so  will  the  trajectories 
of(A,B) 

The  mathematical  explanation  of  this  phenomenon  resides  in  a  careful  study  of  the  so¬ 
lutions  to  the  singular  differential  equation.  We  will  illustrate  this  on  a  nonlinear  scalar 
example  with  constant  leading  coefficient.  Consider  control  systems  of  the  form: 


Ax  +  B(x,x)  =  u 

Assume  that  ( A ,  B)  have  nominal  values  ( A ,  B)  and  assume  that  we  use  as  trajectory 
pattern: 

<j>{ai,a2,t)  =  ait2  - f  a2t3 

i.e.,  the  goal  is  to  be  at  time  t  =  0  at  the  origin  with  0  velocity.  Then 

-  4  i  6  x 

u*(x,x,  t)  =  A(*y-  -  — )  +  B{x,  x) 

The  open  loop  control  system  given  by  the  Trajectory  Pattern  method  is  then: 

Ax  =  A{- —  -  ^)  A  B(xx)  -  B(x,x) 

This  differential  equation  is  singular  at  t  =  0  and  the  leading  order  is  determined  by  the 
linear  singular  differential  equation: 

Its  indicial  equation  for  solutions  of  the  form  tp  is  the  quadratic  equation: 

Ap(p  —  1)  =  A(4p  —  6) 

If  A  —  A,  the  solution  are  pi  =  2  and  pi  —  3  (as  should  be  the  case).  Consequently, 
when  Aj A  is  close  to  one,  the  roots  p\  and  pi  are  close  to  2  and  3.  But  then  for  the  original 
equation  all  solutions  (x(t),x(t))  which  are  for  t  near  zero  close  to  the  point  (0,0)  actually 
pass  through  (0,0)  and  x(f)  goes  to  zero  at  least  as  fast  as: 

tn  or  tpi  ln(<) 

Remark:  If  px  <2  then  x(t)  may  become  unbounded  as  t  — *  0  and  forces  become  un¬ 
bounded.  This  can  be  fixed  by  choosing  the  trajectory  patterns  so  that  also  the  acceleration 
vanishes  at  t  =  0.  For  instance: 

4>(ai,a2,  t)  =  a\t3  - f  a2t4 
Then  the  roots  will  be  close  to  3  and  4. 
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Constraints 

We  next  illustrate  the  Trajeci  ~y  Pattern  Method  in  the  presence  of  constraints.  There 
are  basically  two  kinds  of  constraints.  Holonomic  and  nonholonomic  constraints  which  give 
relations  between  the  variables  in  the  state-space  and  lessen  the  dimension  of  the  state-space. 
We  will  discuss  these  briefly  at  the  end  of  this  section.  The  constraints  we  consider  first  are 
those  which  reduces  the  dimension  of  the  input-space.  Such  constraints  can  for  example 
occur  in  vibrational  systems  with  finite  dimensional  models  due  to  the  absence  of  actuation 
in  some  degrees  of  freedom.  Here  the  segments  are  divided  in  small  segments  which  are 
considered  to  be  rigid.  At  most  of  the  joints  between  these  segments  one  cannot  exert  any 
actuating  torque.  The  joints  then  fall  into  the  two  classes:  ones  where  one  can  exert  torque 
and  the  ones  where  one  cannot.  The  dimension  of  the  input-space  is  then  less  than  the 
dimension  of  the  configuration  space. 

We  assume  that  the  equations  are  again  second  order,  but  that  the  control  vector  u  is  of 
lower  dimension  than  the  configuration  space: 

A{  x,x)x  +  B(  x,x)  =  (q) 

Here  u  is  in  the  m-dimensional  input  space.  Let  us  assume  that  the  configuration  space 
C  =  Rn.  We  then  write  any  vector  x  €  C  in  terms  of  its  m-dimensional  component  xx  and 
its  n  —  m  -dimensional  component  x2: 

We  then  have  the  constraint  equation: 

A2,i(x,x)x1  +  A2,2(x,x)x  2  +  B2(x,x)  =  0 

Contrary  to  the  previous  case  the  actual  construction  of  trajectories  is  a  problem.  If  we  now 
assume  that  the  matrix  A2t2  is  invertible,  then  this  constraint  equation  can  be  considered  as 
a  differential  equation  for  x2  in  terms  of  (xi(f),ii(t)),  which  can  then  in  principle  be  solved 
for  £2(0-  Therefore,  given  a  trajectory  xi(f),  and  initial  conditions  (x2(t0),x2(t0)),  then  the 
required  computed  torque  for  the  system  to  follow  the  trajectory  Xj(t)  is: 

«(*)  =  ^i.i(a:(0»i(0)ii(f)  +  Mi.2(x(t),x(t))x2(0  + 

For  the  implementation  of  the  Trajectory  Pattern  Method,  two  related  issues  need  to  be 
addressed.  The  first  is  that  one  may  also  wish  to  control  the  motion  of  x2(t).  The  second  is 
that  it  is  in  general  impossible  to  exactly,  let  alone  algebraically,  determine  X2(t)>  given  Xj(t). 
However,  within  many  classes  of  systems,  approximate  explicit  solutions  can  be  obtained 
using  algebraic  techniques.  There  is  always  a  trade  off  between  the  length  of  the  time  interval 
over  which  the  solution  is  approximated  and  the  degree  of  accuracy  of  the  approximation. 

In  most  cases  of  interest,  the  constraint  equation  is  predominantly  linear  in  x2 ,  because 
x2,  the  “uncontrolled”  variable  should  be  kept  small.  If  that  is  really  impossible,  most 
engineering  designs  would  in  fact  add  more  actuators.  If  the  constraint  equation  is  linear, 
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the  Trajectory  Pattern  Method  then  proceeds  by  algebraically  solving  it.  If  for  instance 
Xj(t)  is  given  as  a  Fourier-series  with  a  fixed  period,  determine  the  Fourier-series  expansion 
of  x\  using  the  same  period.  The  effectiveness  of  such  a  procedure  for  practical  purposes 
depends  on  to  what  extent  this  algebraic  procedure  can  be  numerically  implemented.  Since 
the  structure  of  the  model  equations  is  known,  in  many  instances  this  can  be  done  in  real 
time  [10].  Granted  this  procedure,  the  motion  of  x2  can  then  be  controlled  by  choosing 
trajectory  patterns  for  x». 

Of  course,  if  the  constraint  differential  equation  has  constant  coefficients,  explicit  formulae 
for  x2  in  terms  of  Xj  are  easily  obtained. 

In  the  case  of  holonomic  and  nonholonomic  constraints,  again  one  has  to  synthesize  trajec¬ 
tories  that  satisfy  these  constraints.  The  method  of  choice  is  direct  elimination  of  extraneous 
variables  if  possible.  Otherwise,  one  can  make  local  approximations  using  techniques  such 
as  implicit  differentiation. 

3.  ADAPTIVE  PARAMETER  BASED  UNCERTAINTY  SCHEMES 

Consider  a  class  of  second  order  differential  control  systems  with  full  control: 

A(x,x,p)x  -(-  B(x,x,  p)  =  u 

Here  we  assume  that  the  parameter  p  ranges  over  a  finite  dimensional  space  of  system 
parameters:  masses,  lengths,  moments  of  inertia  etc.  In  particular,  the  exact  value  of  the 
parameters  is  usually  not  known  and  the  object  is  to  devise  adaptive  control  schemes  that 
compensate  for  this  uncertainty.  There  are  two  types  of  approaches  to  this.  1  he  first  type 
consists  of  devising  a  learning  scheme  which  continuously  makes  best  guesses  for  the  values 
of  the  parameter.  Such  schemes  are  usually  based  on  the  observation  that  if  we  take  any 
time-interval  [/0,  <i]  on  which  we  assume  that  the  values  of  the  parameters  do  not  change 
and  give  any  input  u(t),  then  the  values  of  the  parameters  can  be  determined/  estimated  by 
observing  the  resulting  motion  x(t).  If  we  assume  that  the  differential  equation  .pends  on 
the  parameters  in  an  algebraic  manner  this  leads  to  algebraic  equations  which  can  be  solved 
to  give  parameter  values  which  are  consistent  with  the  observed  motion. 

This  approach  can  be  implemented  in  the  Trajectory  Pattern  Method  as  follows.  Assume 
a  nominal  value  p  for  the  parameter  and  consider  a  given  trajectory  pattern  <f>(a,t).  Two 
distinct  situations  are  usually  encountered:  regulation  and  tracking.  By  regulation  we  mean 
that  eventually  we  want  an  end  goal  to  be  achieved.  We  assume  that  the  trajectory  patterns 
have  these  end  goals  and  the  object  then  is  to  make  the  trajectory  parameters  corst.nt.  By 
tracking  we  mean  that  a  specific  trajectory  pattern  described  by  trajectory  parameter  a(0) 
needs  to  be  followed. 

First  consider  the  case  of  regulation.  Apply  Trajectory  Pattern  Control  with 

A(x,i)  -  A(x,x,  p) 

B(x,  x)  =  B(x,i,  p) 

If  we  now  monitor  the  motion  x(<)  and  compute  the  trajectory  parameter  a(<)  then  we  have 
to  consider  two  cases.  The  first  one  is  that  a(t)  is  constant.  In  that  case  the  system  follows 
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the  predesigned  trajectory  and  even  if  p  ^  p  there  is  no  reason  for  action.  The  second 
case  is  that  a(f)  does  vary.  Then  certainly  p  ^  p  and  one  has  to  determine  values  for  p 
which  are  consistent  with  the  observed  variation  in  a(t).  If  a(t)  depends  algebraically  on 
(x(<),  i(0)  then  one  again  obtains  algebraic  equations  for  the  parameter  p.  If  these  equations 
can  be  solved  quickly,  the  nominal  value  p  for  p  can  be  replaced  by  a  more  accurate  value. 
Giving  the  corresponding  trajectory  control  then  results  in  the  actual  trajectories  to  be 
followed.  This  scheme  is  usually  too  optimistic  because  of  measurement  errors  as  well  as  the 
complexity  of  the  algebraic  equations.  Therefore  this  scheme  needs  to  be  combined  with  an 
updating  method  for  p,  for  instance,  based  on  Newton’s  method.  The  main  point  here  is 
that  a(t)  =  <z(f,p,p)  and  we  want  that  a(t )  to  tend  to  a  constant,  i.e. 

dt  a(t,p,p)  =  0,  dja(t,p,p)  —  Q  ■  ■  ■ 

Now  explicit  equations  can  be  obtained  for: 

ata(t,p,p),  <9t2a(t,p,p)  •  •  • 

in  terms  of  p  and  p.  If  we  divide  the  time  interval  in  small  enough  intervals  and 

choose  a  sequence  of  updates  p(i)  constant  in  the  i  —  th  interval,  we  can  determine  an  equation 
for  p(t  +  1)  from: 

dta{t,)  =  dpdta{ti,p(i),p(i))(p(i  +  1)  -  p(i)) 

Here  the  left-hand  side  is  explicitly  measured,  while  the  partial  derivatives  with  respect  to 
p  have  to  be  computed  analytically.  More  equations  can  be  obtained  by  taking  higher  time 
derivatives.  For  the  case  of  tracking,  these  equations  are  supplemented  by  the  requirement 
that  a(t)  tend  to  a(0). 

The  major  advantage  of  the  Trajectory  Pattern  Method  in  this  application  is  that  the 
structure  of  these  equations  is  determined  in  advance.  Therefore  the  equations  are  available 
in  exact  form  and  do  not  introduce  numerical  errors  during  computation,  and  is  available 
tor  automated  control  in  preprocessed  form. 

The  other  approach  is  based  on  the  following  idea,  which  we  illustrate  by  its  first  order 
scheme.  Again  assume  a  nominal  value  p  for  p  and  assume  that  p  is  close  to  p:  p  =  p  +  A p. 
Choose  a  trajectory  pattern  4>  and  apply  as  control: 

u  —  u*  +  Au 

Here  A  u  still  needs  to  be  determined.  The  basic  idea  is  that  if  we  choose  A  u  to  be  zero  in 
an  initial'time  interval,  we  can  learn  the  error  Ap,  by  comparing  the  actual  path  xa  and  the 
desired  trajectory  pattern  xj.  We  now  design  Au  on  the  remainder  of  the  time-interval  by 
planning  that  the  error  between  the  actual  path  and  the  part  tends  to  zero  in  a  prescribed 
manner  and  use  computed  torque. 

Remark:  In  the  previous  approach  we  chose  to  update  p  to  p  +  Ap  and  apply  the 
trajectory  pattern  control.  In  both  approaches,  a  practical  way  to  determine  Ap  is  from  Aa. 

We  will  illustrate  this  technique  in  a  nonlinear  example  with  constant  coefficients: 

Ax  +  Bx2  +  Cx  =  u 
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Given  the  initial  conditions,  the  trajectory  pattern  4>  selects  our  desired  trajectory  xd{t). 
Assume  that  the  nominal  values  for  the  parameters  (A,  B,  C)  are  {A,  B,C): 

A  =  A  +  AA,  B  =  B  +  AB,  C  =  C  +  AC 

Denote  at  any  time  the  actual  trajectory  by  xa(t)  and  the  error  by  e(£): 

e(t)  =  xa{t )  -  xd(t) 

if  we  write  u  as  u*  +  A u  then  we  obtain  the  linearized  equation: 

AA  xd  +  A  B  xd  +  AC  xj-j-Ae-h2Bxde-h0e  =  A  u 

In  the  beginning  of  the  motion  we  set  Au  =  0.  By  observing  i „(£),  we  compute  e(£)  and 
compute  (AA,  AB,  AC).  Now  choose  for  the  remainder  of  the  time  interval  a  trajectory 
pattern  for  e(t).  Then  this  equation  determines  the  adaptive  feed-back  A u(t). 

The  system  in  this  example: 


Ax  +  B  x2  +  C  x  =  u 

has  some  additional  structure.  If  x  is  given  by  a  Fourier-series  of  size  N  with  frequency  ur. 

+N 

x(t)  =  ^2  xn  exp(inijjt) 

-N 

then  u(t)  is  given  as  a  Fourier-series  of  size  2  N  . 

Consequently,  if  xd(t )  is  given  as  a  Fourier-series  of  with  N  and  we  plan  e(t)  to  be  also 
a  Fourier-series  of  width  N  then  also  Att(£)  is  given  as  a  Fourier-series  of  width  2 N. 

4.  DISCUSSION 

In  this  paper  we  have  demonstrated  the  basics  of  the  Trajectory  Pattern  Method.  It 
is  a  feed-forward  method  and  is  based  on  a  accurate  knowledge  of  the  system  and  on  the 
designers  choice  of  desired  trajectories.  We  have  shown  that  robust  long  time  control  and 
accurate  aiming  is  possible  using  focusing  trajectories.  We  also  showed  that  the  Trajectory 
Pattern  Method  extends  so  as  to  incorporate  parameter  uncertainty  compensation  schemes. 

For  practical  implementations  of  this  method  one  has  to  consider  the  effects  of  small 
delays  in  the  control.  Such  delays  occur  because  measurements  have  to  be  made  and  com¬ 
putations  have  to  be  performed.  We  mention  briefly  to  what  extent  the  method  is  robust 
also  for  small  time-delays.  ^From  the  theory  of  smooth  differential  equations  with  delay, 
it  is  known  that  as  the  delay  tends  to  zero,  solutions  converge  to  the  differential  equation 
with  delay  zero.  Therefore,  the  Trajectory  Pattern  Method  is  also  robust  for  small  enough 
time  delays  except  in  the  neighborhood  of  focusing  points,  where  the  differential  equation 
becomes  singular  and  very  near  by  the  focusing  point,  the  control  after  delay  may  well  point 
in  the  opposite  direction  of  what  is  desired.  Near  such  focusing  points  the  system  behaves 
linearly  and  the  control  can  be  switched  to  simple  linear  controls  such  as  PID  control. 
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A  successful  implementation  of  the  Trajectory  Pattern  Method  generally  has  a  strong 
computational  component.  This  explains  why  this  method  was  only  recently  developed.  A 
careful  selection  of  the  trajectory  patterns  tremendously  reduces  the  on-line  computational 
component  [1,  3,  8,  10,  12]. 
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Introduction 

Advances  in  computer  technology  have  enabled  scientists  and 
engineers  to  analyze  increasingly  sophisticated  problems. 
Supercomputer  access  has  made  three-dimensional  (3D)  finite  element 
modeling  a  tenable  approach  to  problems  once  simplified  to  one  or 
two  dimensions  for  computational  purposes.  Such  refinements  in 
computational  methods  have  compelled  the  technical  community  to 
perform  prototype  testing  of  more  complex  problems  for  the  purposes 
of  validating  these  computational  methods.  A  major  obstacle  to 
overcome  in  this  process  is  the  inability  absorb  large  amounts  of 
output  from  analyses  of  complex  3D  problems.  Visualization 
techniques,  which  are  able  to  graphically  display  analytical  output 
in  a  concise  manner,  have  become  absolutely  necessary  for 
effectively  handling  these  analyses.  The  U.S.  Army  Corps  of 
Engineers  Waterways  Experiment  Station  (WES)  has  been  actively 
employing  state-of-the-art  visualization  techniques  in  support  of 
its  research  mission.  This  paper  discusses  a  few  areas  in  which 
visualization  techniques  have  proven  beneficial  in  projects 
sponsored  by  the  U.S.  Department  of  Defense,  Defense  Nuclear  Agency 
(DNA)  and  also  discusses  details  of  visualization  hardware  and 
software  utilized  for  these  analyses. 

B^skqrQyind 

DNA  supports  research  concerning  the  responses  of  structures 
to  dynamic  loads  caused  by  nuclear  detonations.  This  research 
typically  involves  high  pressure  blast  loading,  soil-structure 
interaction,  nonlinear  deformations  and  failure  mechanisms  such  as 
buckling.  Results  of  DNA-sponsored  research  have  led  to  an 
improvement  in  the  ability  to  analyze  these  types  of  problems, 
however,  certain  structural  response  phenomena  have  not  been 
satisfactorily  analyzed  by  even  the  most  complex  finite  element 
codes.  There  will  continue  to  be  a  need  for  research  to  improve 
computational  methods  in  many  areas  of  complex  structural  response. 

WES  has  been  active  in  analytical  projects  for  DNA  which 
address  their  interests  in  structural  response  phenomena.  This 
research  has  included  a  high  precision  test  program  so  that  the 
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response  of  test  structures  could  be  used  to  assess  the  accuracy  of 
the  finite  element  calculations.  Visualization  methods  have  proven 
very  beneficial  and  in  some  cases  indispensable  for  3D  finite 
element  analysis  of  these  problems.  Visualization  methods  have 
become,  and  will  become  even  more  so,  an  integral  part  of  the 
analytical  process. 

Visualization  Techniques 

Visualization  techniques  are  graphical  means  of  displaying 
large  amounts  of  data  in  a  manner  that  can  be  easily  and  accurately 
interpreted.  Techniques  such  as  rendering  and  animation  lend 
themselves  very  well  for  displaying  complex  3D  finite  element 
models  and  dynamic  analyses  involving  the  interaction  of  moving 
parts.  Many  innovative  visualization  methods  have  been  advanced  in 
recent  years.  Coupled  with  a  graphics  workstation,  visualization 
methods  can  dramatically  improve  post-processing  activities  of 
computer  analyses  using  finite  element  methods.  In  the  following 
sections  several  benefits  of  using  up-to-date  visualization 
techniques  are  discussed.  Unfortunately,  because  visualization 
methods  are  primarily  in  color,  visual  examples  of  these  methods 
could  not  be  included  in  this  paper. 

Increased  Efficiency 

The  use  of  advanced  color  graphics  to  display  data  can 
substantially  reduce  the  time  required  to  understand  analytical 
results  allowing  the  researcher  to  more  thoroughly  address  the 
problem  at  hand.  This  is  especially  true  for  analytical  methods 
which  produce  considerable  output.  In  a  3D  analysis  of  a  complex 
dynamic  soil-structure  interaction  problem  on  a  supercomputer, 
finite  element  models  typically  contain  less  than  60,000  elements 
to  keep  runtimes  reasonable,  i.e.  less  than  10  cpu  hours.  Computer 
runs  such  as  this  can  produce  data  dump  files  which  total  several 
gigabytes.  Using  older  post-processing  methods  a  researcher  would 
have  to  ignore  much  of  this  output  and  limit  post-processing  to  a 
few  chosen  areas  and  time  states.  Older  methods,  for  example  black 
and  white  strain  contour  lines  overlaid  on  a  view  of  the  finite 
element  mesh,  must  often  be  examined  meticulously  just  to  see 
trends  in  an  analysis.  Advanced  visualization  methods,  using  color 
fringing  to  represent  the  same  data,  allow  the  researcher  to  see 
trends  without  getting  lost  in  details.  Because  it  is  easier  to 
spot  trends,  better  understanding  of  the  performance  of  the  model 
as  a  whole  is  attained. 

With  improved  efficiency,  it  is  not  necessary  to  impose 
constraints  that  limit  post-processing  to  a  few  specified  model 
locations  and  points  in  time  simply  because  it  is  too  cumbersome  to 
interpret  results.  It  takes  only  one  glance  to  see  the  color  coded 
location  of  peak  response  of  the  model  and  a  second  glance  at  the 
color  bar  to  know  the  value.  For  complex  geometry  and  responses, 
older  methods  of  data  processing  can  require  several  minutes  of 
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examination  just  to  find  the  locations  of  peak  response.  It  is 
obvious  that  as  analytical  methods  advance  to  address  more  complex 
problems,  the  improved  efficiency  attainable  with  advanced 
visualization  techniques  will  be  a  necessity. 

Improved  Modeling 

A  better  understanding  of  the  performance  of  a  model  enhances 
efforts  to  improve  the  model.  A  benefit  of  advanced  visualization 
is  the  ability  to  recognize  modeling  errors  or  the  effect  of 
modeling  techniques  used  to  simplify  the  problem.  For  example, 
often  analysts  purposely  increase  element  size  at  model  locations 
away  from  points  of  interest  in  order  to  reduce  the  problem  size 
and  decrease  run  time.  In  the  finite  element  code  DYNA3D 
(Reference  1 )  this  change  to  coarser  meshing  can  be  achieved  using 
a  tied  surface  between  the  regions  of  fine  meshing  and  coarser 
meshing  without  using  transition  elements.  Before  using  advanced 
visualization  the  effect  of  this  technique  on  the  model  performance 
was  never  fully  appreciated.  In  problems  involving  shock  waves 
passing  through  soil  the  shock  front  was  significantly  smeared  and 
distorted  at  the  location  of  the  mesh  discontinuity  and  in  regions 
of  coarser  meshing.  The  degree  of  smearing  and  distortion  was 
easily  visible  in  color  fringe  plots  of  soil  stress.  Using  older 
visualization  methods  the  effect  of  this  meshing  technique  did  not 
stand  out  in  the  results.  This  experience  highlighted  the 
importance  of  ensuring  that  such  mesh  discontinuities  are  located 
well  away  from  points  of  interest. 

Improved  Visualization 

As  the  name  implies,  advanced  visualization  methods  help  an 
analyst  to  visualize  the  subject  of  an  analysis  and  the  phenomena 
involved.  This  is  especially  helpful  in  a  dynamic  analysis  which 
includes  large  deformations  and  structural  interaction  between 
different  parts.  Animation  is  a  great  tool  for  checking  model 
performance  and  observing  how  the  model  simulates  the  physics  of  a 
problem.  Without  animation  one  must  mentally  reconstruct  the 
sequence  of  events  leading  up  to  the  final  results.  Looking  at 
individual  frames  from  a  dynamic  analysis  provide  much  information 
for  doing  this  but  putting  these  frames  into  motion  yields  much 
more.  Animation  allows  one  to  visualize  the  time  dependent  aspect 
of  data  in  a  dynamic  analysis.  Phenomena  such  as  buckling  and 
vibration  can  be  assessed  intuitively  because  animation  plays  back 
the  data  in  a  natural  format.  There  have  been  several  occasions  in 
which  animation  answered  questions  that  had  been  left  unanswered  by 
examining  analytical  results  in  a  piecemeal  fashion. 

Improver!  rvimmunication 

Visualization  methods  facilitate  better  transfer  of 
information  in  the  technical  community.  One  of  the  most  difficult 
jobs  for  the  analyst  is  to  share  information  with  someone  who  is 
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not  familiar  with  the  details  of  an  analysis.  Advanced 
visualization  can  present  a  large  amount  of  information  in  a  manner 
that  is  concise  and  not  visually  overwhelming.  it  therefore, 
requires  much  less  effort  to  understand  a  concept  or  a 
calculational  result. 

Visualization  methods  at  WES 

At  WES,  structural  engineers  from  the  Structural  Mechanics 
Division  (SMD)  are  assisted  in  developing  methods  to  visualize 
analytical  results  by  an  interdisciplinary  visualization  team  of 
computer  scientists  and  engineers  within  the  Information  Technology 
Laboratory  (ITL).  In  the  following  sections  an  example  of  how 
visualization  methods  were  developed  and/or  modified  for  a  specific 
DNA  project  will  be  described  in  detail.  The  project  involved  a 
DYNA3D  finite  element  analysis  of  a  dynamic  soil-structure 
interaction  and  included  large  deformations  and  buckling  of  a 
buried  structure.  The  nature  of  the  problem  required  a  detailed 
structure  and  a  large  island  of  soil.  The  resulting  finite  element 
model  was  relatively  large,  approximately  50,000  nodes,  and 
required  from  2  to  4  hours  of  CPU  time  to  run  on  a  Cray  Y-MP  at 
WES. 


The  initial  goals  of  the  visualization  were  loosely  defined. 
Clearly,  the  project  demanded  some  type  of  finite  element 
compatible  program  capable  of  handling  time-dependent  data.  In 
addition,  the  engineers  desired  a  visual  product  that  could  serve 
as  both  an  effective  workroom  tool  as  well  as  a  means  of  obtaining 
presentation  quality  graphics. 

The  visualization  team  explored  several  alternatives  in  an 
effort  to  satisfy  this  confluence  of  goals.  The  first  option 
explored  was  attempting  to  run  DYNA3D  on  a  graphics  workstation. 
However,  the  extent  of  the  DYNA3D  data  both  in  the  number  of  files 
needed  (two  files  per  state  dump  with  50  state  dumps)  and  the  size 
of  the  files  (approximately  3.6  megabytes  each)  made  this  approach 
untenable.  As  limitations  with  other  alternatives  continued  to 
surface,  it  became  apparent  that  a  distributed  computing 
environment  was  essential  to  gain  the  computational  power,  disk 
capacity,  and  graphics  capabilities  necessary  for  the  task.  Multi- 
Purpose  Graphics  System  (MPGS),  a  software  package  produced  by  Cray 
Research  Inc.,  emerged  as  the  candidate  addressing  the  largest 
number  of  the  project's  requirements.  Residing  on  the 
supercomputer  and  a  local  graphics  workstation,  MPGS  provided  the 
distributed  computing  environment  deemed  requisite  for  the  project. 
Further,  because  it  makes  no  assumptions  about  the  geometry,  the 
software  can  be  used  with  a  variety  of  computer  programs,  including 
those  for  finite  element  analysis. 

Hardware  Configuration 

The  hardware  required  to  visualize  the  finite  element  analysis 
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consisted  of  two  distributed  components,  a  Silicon  Graphics 
workstation  and  a  Cray  Y-MP  supercomputer,  both  located  at  WES  in 
ITL .  The  workstation  and  the  supercomputer  shared  the 
computational  workload  via  MPGS  to  produce  the  3D  graphical  images 
for  animation.  Two  25-MHz  MIPS  processors  enabled  the  workstation 
to  perform  the  local  graphics  manipulations:  the  workstation 
display  has  a  resolution  of  1280  by  1024  pixels  with  24  bit-planes 
of  color.  The  workstation  and  the  supercomputer  communicated  over 
a  standard  Ethernet  network  connection  using  TCP/IP;  MPGS  uses 
sockets  to  move  data  between  the  workstation  and  the  supercomputer. 
These  network  connections  permitted  compute-intensive  tasks  to  be 
performed  on  the  supercomputer  and  the  results  downloaded  to  the 
workstation  from  within  MiJGS. 


Software  Issues 


Data  Format 

MPGS  can  process  up  to  three  categories  of  data  for 
visualization.  First,  the  required  geometry  data  describe  nodes, 
lines,  elements,  and  solids.  The  geometry  data,  which  may  comprise 
several  files,  represent  only  the  initial  geometry  of  the 
structure.  The  optional  scalar  data  contain  multiple  scalar  data 
for  each  node.  The  optional  vector  data  contain  one  3D  vector 
value  for  each  node.  For  time-dependent  scalar  or  vector  data, 
multiple  files  are  necessary,  each  representing  a  state  dump  of 
output  data. 

The  DYNA3D  binary  data  files  requiring  translation  to  MPGS 
format,  each  approximately  3.6  megabytes  in  size,  contain  first  a 
time  stamp  and  control  information,  then  the  geometry  data.  The 
time-dependent  data  follow,  including  nodal  coordinates,  and  scalar 
values.  Multiple  time-dependent  data  sets  cannot  be  contained  in 
one  file.  For  each  new  state  dump,  DYNA3D  opens  a  new  file  and 
writes  the  nodal  and  scalar  data  for  that  state  dump. 

Data  Translation 


Cray  Research  Inc.  furnishes  software  that  translates  DYNA3D 
data  files  to  suitable  MPGS  format.  Specifically,  three 
translators  generate  the  geometry,  scalar,  and  vector  data  files, 
respectively.  The  Cray  Research  translators,  however,  addressed 
only  limited  cases  and  required  two  principal  modifications  (Figure 
1 ) .  First,  each  translator  was  modified  to  read  and  convert  DYNA3D 
data  residing  in  multiple  files.  Second,  each  of  the  scalar  and 
vector  translators  processed  only  brick  elements,  neglecting  shell 
and  beam  elements.  Both  translators  were  modified  to  also  process 
shell  elements.  Enhancements  to  support  beam  elements  are 
currently  being  pursued. 
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VISUALIZING  MISTY  PORT  -  HARDWARE  AND  SOFTWARE  COMPONENTS 
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Viewing  Model  Results 

Once  the  initial  geometry  was  obtained,  it  became  necessary  to 
isolate  specific  parts  of  interest  to  the  animation  and  analysis. 
As  detailed,  MPGS  allows  multiple  element  types  with  attributes 
tied  to  each  part.  Exploiting  this  feature  to  "turn  off"  all 
elements  except  those  representing  the  structure  permitted 
deformations  and  stresses  to  be  viewed  clearly.  To  view 
deformations,  time-dependent  displacement  computations  were 
performed  on  the  supercomputer,  converted  to  graphical  elements, 
and  downloaded  to  the  workstation  via  the  built-in  MPGS 
communications  features . 

To  further  examine  geologic  influences  on  the  structure, 
vertical,  horizontal,  and  shear  stress  contours  were  computed  and 
rendered.  The  rendering  of  the  stresses  illustrated  problems  with 
tied  surfaces  connecting  zones  of  fine  discretization  with  coarse 
discretization  in  the  soil  adjacent  to  the  structure.  The 
information  led  to  rediscretizing  the  soil-structure  system  and 
moving  the  tied  surfaces  away  from  the  structure.  This  resolved 
problems  with  the  buckling  of  the  structure  due  to  approximations 
associated  with  the  tied  surfaces.  In  other  analyses, 
visualization  has  similarly  proven  useful  not  only  for  interpreting 
results  but  also  for  highlighting  problem  areas  with  the  finite 
element  model . 

Conclusions 


Through  the  cooperative  endeavor,  the  structural  engineers 
realized  many  of  their  goals,  some  initially  unarticulated. 
Examination  of  deformed  shapes  and  stresses  from  the  analysis  in  a 
dynamic  presentation  provided  details  of  soil-structure  interaction 
which  had  never  been  visualized  before.  Visualization  provided  the 
opportunity  to  see  the  characteristics  of  shear  friction  between 
the  soil  and  the  structure  and  the  role  of  the  soil  in  determining 
the  buckling  mode  of  the  structure.  This  demonstrates  that  complex 
structural  systems  can  be  more  easily  solved  when  visualization 
tools  are  provided  to  assess  numerical  models  and  analytical 
results . 
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ABSTRACT .  The  trajectory  of  an  artillery  projectile  can  rarely  be 
characterized  by  a  single  set  of  meteorological  measurements  (MET  Message) ; 
however,  current  practice  is  to  use  a  single  MET  message  to  derive  the  final 
aiming  adjustments  necessary  to  hit  the  target.  Examining  the  meteorological 
effects  upon  projectiles  dynamically  through  modeling  allows  a  better 
understanding  of  the  probability  of  hitting  the  target  the  first  time.  A 
criterion  is  developed  for  utilizing  multiple  MET  messages  and  applying  a  solo 
MET  message  at  particular  projectile  trajectory  locations  to  obtain  a  high 
probability  of  success.  The  Advanced  Battlefield  Environment  Artillery  Model 
(ABEAM),  a  U.S.  Army  Atmospheric  Sciences  Laboratory  version  of  the  U.S.  Army 
Ballistic  Research  Laboratory  General  Trajectory  (GTRAJ)  model,  is  the 
advanced  model  environment  for  developing  this  examination.  ABEAM  is  used  in 
prototyping  new  artillery  meteorological  techniques  for  current  and  future 
artillery  systems.  Preliminary  results  reveal  significant  aiming  adjustment 
differences  computed  from  a  proposed  MET  correction  and  the  current  MET 
doctrine.  Since  multiple  MET  messages  allow  a  more  realistic  description  of 
the  battlefield  atmosphere,  the  proposed  methodology  can  significantly  improve 
the  accuracy  of  artillery  predicted  fire. 

INTRODUCTION .  For  each  mission  the  field  artillery  has  to  quickly 
respond  with  accurate,  massed,  and  surprise  fire.  A  highly  efficient  fire  for 
engagement  at  extended  ranges  demands  a  modern  artillery  fire  direction.  The 
U.S.  Army  Atmospheric  Sciences  Laboratory  (ASL)  is  using  applied  mathematics 
and  innovative  computing  algorithms  to  demonstrate  the  possible  artillery 
predicted  fire  accuracy  improvement.  This  note  presents  a  brief  description 
of  a  modern  meteorological  (MET)  adjustment  technique;  artillery  accuracy 
comparisons;  MET  scenario  results;  and  simulated  paired  statistics  of 
significant  accuracy  improvements . 

The  current  MET  aiming  adjustment  for  predicted  fire  applied  to  aiming 
future  extended  target  ranges  is  not  sufficient  most  of  the  time.  Since  the 
atmospheric  conditions  are  not  homogeneous  along  the  projectile  trajectory, 
the  current  method  of  adjusting  artillery  fire  with  MET  data  collected  from  a 
dedicated  station  is  no  longer  valid.  With  these  extended  ranges  the  expected 
MET  bias  increases  and  continues  to  be  a  major  contributor  in  the  total 
artillery  error  budget  (see  Lillard  et  al,  1990).  The  lengthening  flight  time 
has  significantly  increased  the  unit  MET  corrections.  To  continue  accurate 
artillery  fire  capability  at  the  new  ranges  an  improved  MET  final  aiming 
adjustment  is  required.  ASL  is  researching  new  procedures  for  allowing  a  more 
realistic  battlefield  atmosphere  into  the  artillery  fire  direction  center.  A 
module  that  allows  multiple  MET  message  inputs  and  solo  message  application  at 
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particular  trajectory  portions  was  incorporated  into  the  U.S.  Army  Ballistic 
Research  Laboratory  GTRAJ  model.  This  modification  provides  an  advance  model 
environment  for  testing  the  worth  of  proposed  MET  adjustment  techniques.  The 
new  configuration,  the  ABEAM  was  validated  and  tested  by  Blanco  and  Edwards 
(1991). 

The  Project  PASS  (Prototype  Artillery  Subsystem)  field  data  (see  Blanco 
and  Traylor,  1976)  is  utilized  for  the  testing  of  proposed  MET  adjustment 
procedures.  Because  of  time  constraints  only  2  of  the  20  data  days  are  used 
to  present  results  from  the  proposed  MET  adjustment  as  compared  to  the  current 
MET  correction.  These  2  selected  data  days  contain  the  most  variable  weather 
conditions.  A  preliminary  conclusion  presents  the  expected  artillery  accuracy 
improvement  afforded  by  proposed  multiple  MET  messages  and  selection  of  a  solo 
MET  message  along  the  launch,  apogee,  and  target  projectile  flight  location. 

MODERN  ARTILLERY  MET  ADJUSTMENT.  In  aiming  artillery  the  fire  direction 
center  converts  weapon  and  target  location  into  firing  data  and  utilizes  a 
ballistic  simulation  model  that  requires  MET  data  input  for  the  final  firing 
angles  solution.  A  single  computer  MET  message  (METCM)  provides  the 
atmospheric  description.  The  header  in  figure  1  Identifies  the  MET  station 
location  and  the  body  lists  values  for  the  following  MET  parameters — wind 
direction,  windspeed,  virtual  temperature,  and  atmospheric  pressure.  The  raw 
MET  data  are  averaged  into  horizontal,  vertically  layered  computer  zonds 
listed  on  the  first  column.  The  last  computer  zone  is  line  number  26  with  the 
bottom  and  top  at  19  and  20  km  above  the  surface.  The  current  fire  direction 
application  assumes  that  the  computer  MET  message  data  is  horizontally 
homogeneous  along  all  portions  of  the  projectile  trajectory.  This  is  not  a 
valid  assumption,  especially  at  extended  ranges. 
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Figure  1.  Artillery  computer  MET  message. 

Note  that  there  are  specific  MET  effects  along  different  portions  of  the 
projectile  trajectory.  For  example,  a  rocket  assisted  round  or  a  thrusting 
rocket  require  a  tailored  launch  cross  wind  adjustment.  This  effect  is 
propagated  throughout  the  whole  flight  time,  but  the  interaction  takes  place 
during  initial  motion  and  burnout  time  -  the  launch  portion.  The  time  from 
burnout  to  the  warhead  event  defines  the  coasting  portion.  And  the  time  from 


the  warhead  event  and  impact  defines  the  target  area.  Depending  on  the 
relative  MET  station  location  and  the  wind  flow,  the  dedicated  MET  message 
may  not  be  representative  of  the  launch,  coasting,  or  target  area  MET.  For 
this  reason  the  artillery  needs  a  modern  MET  message  that  can  provide  more 
realistic  data  along  these  trajectory  portions. 

Using  the  standard  munitions  aerodynamic  coefficient  tables  extended  to 
Mach  5  (see  Lieske,  1990)  to  represent  future  artillery  capability,  the  GTRAJ 
model  was  utilized  to  compute  four  projectile  trajectories.  Figure  2  details 
the  projectile  position  at  a  2-sec  flight  time  interval.  The  apogee  or 
coasting  area  for  the  15-km  trajectory  is  defined  as  that  portion  above  the  2 
km  where  the  projectile  spent  over  20  sec  flight  time.  Approximately  one-half 
of  all  extended  range  trajectory  flight  time  is  spent  in  the  apogee  portion. 
For  the  coasting  rocket  or  shell  the  most  important  MET  observations  are 
required  between  the  preapogee  and  postapogee  portion.  Normally,  windspeed, 
the  major  MET  contributor,  is  at  a  maximum  for  the  trajectory  during  apogee; 
thus,  the  MET  effect  has  its  best  chance  to  deviate  the  projectile  away  from 
the  Initial  aim  point  at  this  apogee  portion.  And  for  the  accurate  delivery 
of  submunitions  the  fire  direction  must  use  realistic  target  area  MET. 
Parachute— delivered  submunitions,  chemical  warheads,  and  wind  gliding  warheads 
require  accurate  wind  measurements  in  the  target  area. 
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Figure  2.  Long  range  artillery  trajectories. 

Centralizing  all  available  battlefield  METCM  and  tailoring  a  METCM  for  a 
particular  trajectory  portion  provides  a  more  realistic  atmospheric 
description.  The  fire  direction  center  can  receive  multiple  computer  MET 
messages  and  select  the  most  representative  for  application  at  the  different 
portions  of  the  trajectory.  This  procedure  may  be  automated  in  the  fire 
direction  center  or  manually  performed  by  its  operator.  The  same  North 
Atlantic  Treaty  Organization  (NATO)  standardized  format  is  used  and  the  only 
change  is  to  report  all  available  METCMs  to  the  fire  direction  center  and 


follow  criteria  In  st. acting  an  appropriate  solo  METCM  for  the  corresponding 
trajectory  portion. 

ARTILLERY  ACCURACY  COMPARISONS.  The  worth  of  the  proposed  MET  aiming 
techniques  can  be  evaluated  through  statistical  analysis  of  simulated  and 
actual  impacts.  Figure  3  identifies  a  self-propelled  howitzer  and  a  target 
location  normalized  impact.  The  solid  line  ellipse  represents  the  accuracy 
and  MET  day-to-day  dispersion  afforded  by  simulated  no  MET  corrections.  This 
procedure  of  using  the  standard  MET  conditions  to  represent  current  conditions 
is  known  as  the  "cold  stick"  method.  In  this  case  the  bias  errors  (largest 
displayed)  represent  the  total  uncorrected  MET  effect.  Method  A  represents 
the  actual  surveyed  MET  bias  errors  afforded  by  the  current  doctrine.  If  the 
dedicated  METCM  was. representative  of  what  the  shell  experienced,  then  the 
dashed  line  ellipse  would  be  centered  about  the  target  with  only  the  MET  day- 
to-day  and  hardware  precision  dispersion.  Method  B  represents  the  simulated 
improvement  afforded  by  using  MET  data  that  is  more  representative  of  the  MET 
experienced  by  the  projectile  as  it  traverses  through  the  different  portions 
of  its  flight  trajectory.  The  smallest  bias  errors  are  not  zeroed  because  of 
the  always  present  time  staleness  and  space  displacement  between  the  MET 
measurement  and  projectile  location.  However,  the  MET  day-to-day  precision 
error  is  significantly  improved  because  the  proposed  technique  consistently 
reduces  the  MET  bias. 


Figure  3.  Expected  MET  effects  on  artillery  fire. 

By  modifying  the  above  methodology  one  can  extract  the  same  comparison 
analysis  while  only  using  simulated  results;  however,  there  are  no  absolute 
target  misses.  The  paired  differences  between  methods  A  (doctrine)  and  B 
(proposed)  are  the  statistics  required.  Through  simulation  one  can  locate  the 
gun  position  and  target  location  so  that  the  projectile  trajectory  apogee 
traverses  through  an  area  where  actual  measurements  are  recorded.  An 
experiment  is  then  designed  to  compare  simulated  impacts  using  a  single  METCM 
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and  simulated  impacts  using  a  solo  METCM  at  the  launch,  apogee,  and  target 
trajectory  portions. 


Schematically,  figure  4  represents  the  procedure  followed  in  deriving 
these  impact  dispersions.  For  a  given  MET  scenario  the  single  METCM  and  ABEAM 
multiple  messages  are  inputs  to  the  GTRAJ  model  for  deriving  a  pair  of 
simulated  impacts.  Available  replicates  for  each  scenario  are  averaged  to 
represent  statistical  results.  If  the  paired  differences  are  small  then  the 
single  METCM  is  representative  of  the  three  launch,  apogee,  and  target  areas. 
This  was  not  experienced  as  stated  in  the  following  sections. 


Figure  4.  Schematic  chart  for  accuracy  comparisons. 

MET  SCENARIO  RESULTS .  During  November  and  December  1974  a  comprehensive 
ballistic  MET  field  experiment  was  completed  at  White  Sands  Missile  Range,  New 
Mexico.  Simultaneous  rawinsonde  data  was  collected  (see  D'Arcy  1977),  and  all 
the  data  is  formatted  in  the  NATO  METCM  format  (see  Field  Manual  FM  6-16, 
1983).  Figure  5  identifies  the  location  of  the  10  MET  stations  with  a  star. 
During  this  experiment  actual  firing  was  aimed  and  fired  on  a  14-km  target 
range.  The  selected  MET  scenario  involves  only  five  MET  stations 
simultaneously  releasing  rawinsonde  balloons  to  achieve  a  multilocation  model 
of  the  MET  effects  for  an  example  battlefield  environment. 

Through  simulation  one  can  relocate  the  gun  position  and  identify  a 
target  such  that  the  simulated  apogee  flight  will  traverse  over  a  measuring 
station.  For  example,  figure  6  presents  the  MET  scenario  for  the  five 
simultaneous  rawinsonde  releases  and  a  simulated  18-km  range  firing.  At  this 
target  range  the  apogee  is  less  than  3  km  above  the  surface.  The  concatenated 
arrows  represent  the  1015  wind  conditions  at  each  of  the  six  artillery 
computer  zones  included  under  the  projectile's  apogee.  This  top  view  displays 
the  balloon  drift  and  the  gun/target  line  of  fire.  Each  arrow  represents  the 
wind  vector  at  the  line  number  listed  in  figure  1. 
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Figure  5.  Field  experiment  at  White  Sands  Missile  Range,  New  Mexico. 


Basting  (km) 


Figure  6.  Space  displacement  MET  effects. 

The  current  doctrine  is  represented  by  superimposing  the  WAR  (site  on 
Martin  Luther  King  Road,  formerly  War  Road)  METCM  (dedicate  station)  along  the 
flight  trajectory.  Data  from  zone  1  is  applied  at  the  launch  and  target  areas 
because  of  the  atmospheric  homogeneity  assumption.  As  stated  in  the  above 
section  the  most  important  trajectory  portion  for  a  coasting  shell  is  the 
apogee.  Comparing  the  apogee  (line  6)  wind  vector  from  WAR  and  TSX  (tower 
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site)  MET  stations  reveals  a  better  understanding  of  the  space  variance  effect 
between  simultaneous  wind  measurements.  Following  the  current  doctrine  the 
gun  is  aimed  with  WAR  data,  but  the  shell  experienced  data  measured  at  TSX. 

TSX  reports  a  total  cross  wind  vector  at  apogee,  while  WAR  reports  both  cross 
and  range  wind  components.  The  largest  expected  miss  is  along  the  range 
because  current  doctrine  required  a  higher  quadrant  elevation  to  compensate 
for  the  WAR  reported  head  wind.  In  reality  there  was  no  head  wind  at  apogee; 
therefore,  the  current  method  of  aiming  involves  a  long  miss  in  the  range,  and 
since  the  WAR  crosswind  does  not  match  the  TSX  cross  wind  there  is  a  left 
miss  in  the  cross  component.  So  even  with  real  time  data  because  of  space 
separation  between  the  measurement  and  the  point  of  application  there  will  be 
an  expected  error.  The  lengthening  of  the  apogee  flight  time  will  continue  to 
increase  expected  errors  because  the  atmosphere  is  not  always  homogeneous. 

The  time  staleness  between  the  MET  measurement  and  the  time  of 
application  presents  the  largest  expected  error.  Figure  7  displays  the 
concatenated  arrows  for  the  1415  wind  conditions  except  for  the  data  collected 
at  the  dedicated  WAR  station.  For  some  battlefield  constraint  the  gun  is  now 
aimed  and  fired  using  the  1015  data  or  with  a  4-hr  old  METCM.  Comparing  these 
results  with  those  from  figure  6  reveals  that  for  this  time  span  the  wind 
profiles  shifted  from  westerly  to  northerly  direction  (blowing  from) .  Examine 
the  wind  vectors  at  line  6,  the  apogee  of  our  simulated  firing.  The  gun  is 
aimed  with  the  same  WAR  cross  and  range  wind  components  described  in  the  above 
scenario.  However,  in  this  case,  the  wind  experienced  by  the  shell  is  more  in 
the  range  component  and  less  in  the  cross  component.  Both  WAR  and  TSX  station 
report  a  head  wind  for  the  range  component,  but  for  the  cross  component  they 
report  significantly  different  winds.  By  simulating  impacts  using  these 
METCMs ,  one  can  define  the  value  of  expected  errors . 


713 


SIMULATED  PAIRED  STATISTICS.  The  artillery  aiming  MET  effects  due  to 
space  and  time  variabilities  have  been  presented  with  no  assigned  accuracy 
errors  results.  Since  this  note  presents  no  actual  firing  comparisons,  only 
statistical  results  can  be  reported.  Following  the  above  description  and  only 
using  the  2  most  variable  weather  days  of  the  20  data  days  a  set  of  replicates 
was  defined.  The  2  days  contain  five  and  four  simultaneous  releases  every  2 
hours.  For  the  space  variability  there  are  nine  replicates  for  firing  from 
WAR  over  TSX  towards  SMR  (small  missile  range) .  In  representing  the 
statistical  sample  for  the  time  staleness  variability,  one  needs  to  pair  the 
results  from  these  nine  replicated  into  2-  and  4-hr  old  staleness.  The  total 
2-hr  stale  paired  results  are  4  and  3  for  each  corresponding  day.  Similar 
results  for  the  total  4-hr  stale  paired  results  are  3  and  2.  Table  1  lists 
the  paired  statistics  comparing  simulated  impacts  using  the  current  doctrine 
of  a  single  dedicated  MET  station  (WAR)  and  the  proposed  multiple  METCMs 
applying  only  selected  solo  METCM  at  the  critical  trajectory  portions  (launch- 
WAR,  Apogee-TSX,  and  Targe t-SMR) . 

TABLE  1.  PAIRED  STATISTICS  BETWEEN  DEDICATED  AND  PROPOSED  METCM. 


MET  SPACE  VARIABILITY  EFFECT 


BIAS(m) 

FLIGHTS 

SIGMA(m) 

Range 

Cross 

n 

Rang* 

Cross 

0  hrs 

93 

37 

9 

149 

43 

MET  TIME  VARIABILITY  EFFECT 

2  hrs 

293 

102 

7 

204 

86 

4  hrs 

375 

158 

5 

312 

159 

With  an  increase  in  replicates,  the  statistical  results  (especially  the 
event  to  event  sigma  results)  will  stabilize  and  be  more  representative  of  a 
general  conclusion.  However,  from  these  preliminary  results  one  can  summarize 
a  trend  in  variability  and  a  relative  accuracy  effects  between  space 
displacement  and  time  staleness.  The  100-m  rms  space  bias  for  an  18-km  target 
range  is  too  much  of  a  difference  to  ignore  in  aiming  artillery.  These 
results  represent  real-time  MET  corrections  and  demonstrate  that  a  significant 
improvement  is  very  possible.  The  310-m  rms  time  bias  for  2-hr  staleness  is 
also  not  acceptable  in  aiming  artillery.  Battlefield  data  must  be  centralized 
and  fused  into  improved  and  modern  MET  correction  techniques.  The  406-m  rms 
time  bias  for  4-hr  staleness  is  definitely  not  acceptable.  Current  MET  aiming 
adjustment  can  be  significantly  improved. 

SUMMARY.  ASL  is  performing  applied  mathematics  and  developing  computing 
algorithms  that  allow  dynamic  simulated  examination  of  meteorological  effects 
upon  projectiles  traversing  through  a  battlefield  atmosphere.  Since  only 
software  product  improvements  are  required,  the  -artillery  MET  aiming 
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adjustment  can  significantly  be  improved,  thereby  enhancing  the  first-round- 
hit  probability.  ABEAM  is  the  advanced  model  environment  used  to  examine  new 
artillery  MET  correction  techniques. 

Preliminary  results  demonstrate  the  expected  MET  space  and  time 
variability  effects  on  aiming  extended  range  artillery.  Paired  statistics 
between  the  current  doctrine  and  the  proposed  technique  contain  too  much  of  a 
difference.  These  statistics  are  summarized  as  follows:  100  m  for  space 
variability,  and  310  m  (2-hr  stale)  and  406  m  (4-hr  stale)  time  variability. 
The  atmosphere  is  not  homogeneous,  especially  along  the  extended  artillery 
firing  ranges.  If  the  U.S.  Army  wishes  to  continue  with  accurate  and 
effective  artillery  fire  at  these  new  ranges,  the  current  method  of  adjusting 
for  nonstandard  MET  conditions  needs  to  be  improved.  ASL  is  investigating 
several  possibilities,  and  this  note  describes  a  model  development  quantifying 
the  expected  MET  errors.  More  replicates  to  increase  the  statistical  sample 
size  will  be  defined  using  all  20  PASS  data  days. 

In  conclusion,  all  battlefield  METCM  need  to  be  centralized.  Data  fusion 
and  tailoring  for  launch,  apogee,  and  target  area  applications  need  to  be 
investigated.  Analytic  functional  approximations,  optimum  interpolation,  and 
mesoscale  modeling  can  be  used  to  define  a  solo  composite  METCM.  More 
frequent  battlefield  MET  measurements  are  needed.  Artillery  balloon-borne 
sensor  data  need  to  be  augmented  with  remote  sensor  data. 
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Abstract 

Techniques  for  visualizing  scientific  data  have  advanced  remarkably  in  recent  years. 
These  techniques  remain  inadequate,  however,  for  interpreting  certain  classes  of  prob¬ 
lems,  such  as  those  including  multivariate  or  complex,  time-dependent  data.  In  addition 
to  the  requirements  of  existing  problems,  the  scope  and  size  of  problems  in  the  near  fu¬ 
ture  will  extend  into  the  teraflop  range,  demanding  innovative  solutions  for  interpreting 
results.  Investigations  of  two  experimental  technologies  for  data  interpretation,  virtual 
reality  and  data  sonification,  are  presented  in  this  paper. 
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1.  Introduction 


The  traditional  scientific  computing  environment  consisted  of  a  large  mainframe  computer 
or  supercomputer  supporting  a  substantial  user  base,  connected  via  dial-up  asynchronous 
lines  and  dumb  terminals.  Typically,  large  simulation  programs  were  run  in  batch  mode. 
To  some  extent,  this  model  continues  to  exist  due  to  the  high  cost  of  the  central  resources. 
Enhancements  to  this  environment,  however,  have  significantly  extended  the  capabilities 
available  to  modern  users. 

Communications  are  faster  and  more  reliable.  Dial-up  lines  have  been  replaced  by 
higher  bandwidth  networks.  Ethernet,  with  a  bandwidth  of  10  megabits  per  second,  and 
Transmission  Control  Protocol/Inteinet  Protocol  (TCP/IP)  are  the  current  standards  for 
interconnecting  systems  [9,  10].  Distributed  computing  and  scientific  visualization  require 
even  higher  transmission  capacities.  These  requirements  are  being  met  with  a  combination 
of  new  networking  architectures,  protocols,  and  transport  media.  For  example,  the  fiber 
distributed  data  interface,  FDDI,  is  replacing  Ethernet  due  to  its  bandwidth  of  100  megabits 
per  second. 

Workstations  have  supplanted  dumb  terminals  on  the  researcher’s  desktop  as  a  means 
of  interacting  with  the  central  computer.  The  workstations  vary  from  personal  computers 
to  high-  performance  graphics  workstations.  These  systems  possess  significant  computa¬ 
tional  and  storage  resources  which  rival  those  of  first  generation  supercomputers.  Now, 
client/server  applications  and  distributed  computing  environments  allow  clusters  of  these 
systems  to  work  together  to  solve  large,  complex  problems. 

Such  an  environment,  in  addition  to  making  large  numerical  simulations  computation¬ 
ally  feasible,  has  fostered  the  development  of  more  sophisticated  techniques  necessary  for 
interpreting  simulation  results.  Visualization  software,  integrating  the  computational  power 
of  supercomputers  with  specialized  graphics  hardware,  has  extended  the  set  of  interpreta¬ 
tion  tools  available  to  the  researcher.  Two-dimensional  line  contours  have  been  expanded  to 
surfaces.  Surfaces  may  have  constant  value  (iso-surfaces)  or  may  be  shaded,  displaying  mul¬ 
tiple  values  using  a  continuous  range  of  colors.  In  addition,  researchers  can  examine  vector 
data  using  particle  traces  or  through  time-dependent  animations  on  their  workstations. 

Clearly,  ’’he  interpretive  techniques  reviewed  above  represent  a  significant  advancement 
beyond  methods  available  only  five  years  ago.  Nonetheless,  certain  classes  of  problems  re¬ 
quire  interpretive  solutions  for  which  current  techniques  remain  inadequate.  Examples  in- 
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elude  studies  with  multivariate  data  as  well  as  complex  time-dependent  problems.  Research 
indicates  that  technologies  engaging  a  fuller  realm  of  human  perception,  incorporating  such 
senses  as  hearing  and  touch  as  well  as  sight,  can  be  effective  for  interpreting  these  classes 
of  problems  [14].  In  addition  to  the  requirements  of  existing  problems,  the  scope  and  size 
of  problems  in  the  near  future,  extending  into  the  teraflop  range,  will  demand  innovative 
solutions  for  interpreting  results.  The  remainder  of  this  paper  examines  two  experimental 
technologies  for  data  interpretation,  virtual  reality  and  data  Bonification,  and  discusses  the 
results  of  our  investigations. 

2.  Experimental  Data  Interpretation 
2.1.  Virtual  Reality 

Virtual  reality  has  generated  profound  interest  in  the  technical  community  for  its  potential 
in  analyzing  scientific  data.  Employed  for  such  a  purpose,  it  seeks  to  engage  the  researcher 
more  fully  in  the  interpretive  process,  allowing  interaction  with  the  data  through  viewing, 
touching,  and  actually  moving  in  the  physical  space  or  "world”  of  the  data.  While  the 
technology  is  receiving  a  great  deal  of  attention  presently,  certain  concepts  embodied  in 
virtual  reality  date  back  to  the  1960’s  when  Ivan  Sutherland  developed  the  first  head- 
mounted  display  system  [24]. 

The  original  head-mounted  display  consisted  of  a  helmet  containing  a  pair  of  CRTs  with 
left-eye  and  right-eye  views,  adjusted  by  computer  according  to  the  user’s  head  movements. 
Researchers  at  NASA  Ames  Research  Center  refined  the  head-mounted  display  to  use  liquid 
crystal  displays  (LCDs)  and  electronic  sensors,  making  it  practical  and  affordable  [8]. 

Two  companies  exemplify  the  commercial  viability  of  virtual  reality,  VPL  Research  Inc., 
and  Fake  Space  Labs.  The  companies  utilize  similar  hardware  configurations  but  with  some 
variation.  VPL  has  chosen  to  use  the  head-mounted  display  in  their  EyePhone  System.  The 
display  employs  two  color  LCDs  with  a  pixel  resolution  of  360  x  240.  Motion  is  detected 
by  the  3Space  Isotrak  Sensor  System  developed  by  Polhemus  Navigation  Sciences  [27].  The 
3Space  system  contains  two  components,  a  source  and  a  sensor.  The  source  generates  two 
hemispherical  electromagnetic  fields,  each  of  which  has  a  radius  of  approximately  33  inches. 
The  sensor,  mounted  on  the  headset,  continuously  transmits  its  spatial  location  to  an  A/D 
converter,  which  then  transmits  this  information  to  the  computer. 

The  Binocular  Omni-OTientation  Monitor  (BOOM)  system  offers  an  alternative  to  the 
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VPL  head-mounted  system.  Manufactured  by  Fake  Space  Labs,  the  BOOM  is  an  extension 
of  a  prototype  developed  at  the  NASA  Ames  View  Laboratory  [4].  The  BOOM  supports 
two  small  CRTs,  each  with  a  pixel  resolution  of  720  x  486,  on  a  counterbalanced  yoke 
attached  through  six  joints  to  a  base.  The  apparatus  uses  optical  encoders  located  in  the 
joints  to  determine  spatial  position  and  orientation. 

A  VPL  DataGlove  is  used  in  both  systems  as  the  means  for  directly  interacting  with  the 
virtual  world.  The  DataGlove  provides  two  types  of  information.  First,  spatial  information 
is  generated  using  a  Polhemus  3Space  Isotrak  system,  described  above.  Second,  information 
concerning  the  hand,  ie.  gestures,  is  determined  through  the  attenuation  of  light  transmitted 
along  custom  optical  fibers  which  span  the  length  of  each  finger  on  the  glove.  These  fibers 
are  quite  sensitive,  so  that  even  such  precise  gestures  as  American  Sign  Language  can  be 
accurately  transmitted  to  the  computer. 

2.1.1.  Working  with  Virtual  Reality:  Observations 

Although  the  VPL  EvePhone  System  and  the  Fake  Space  Labs  BOOM  system  share  cer¬ 
tain  similarities,  they  differ  notably  in  several  respects,  particularly  the  display  technology 
and  the  software  interface  for  creating  virtual  worlds.  The  ensuing  discussion  presents  a 
comparative  analysis  of  the  two  systems,  based  on  our  experiences.  A  report  on  the  focus 
of  current  efforts  as  well  as  comments  on  future  directions  conclude  the  discussion. 

2.1.2.  Hardware  Configuration 

The  display  technologies  of  the  two  virtual  reality  systems  present  a  distinct  choice  to  the 
potential  user.  The  VPL  system  offers  the  authenticity  of  color,  while  the  BOOM  provides 
increased  detail  due  to  the  higher  resolution  display.  For  simulation,  color  is  a  necessary 
component  of  realism.  Conversely,  data  interpretation  benefits  from  the  higher  resolution. 
Eventually,  high  resolution  color  displays  will  be  cost-effective  when  virtual  reality  becomes 
generally  available  and  economies  of  scale  can  be  achieved.  One  must  also  consider  the 
advantages  of  the  VPL  head-mounted  display  versus  the  counterbalanced  viewing  apparatus 
of  the  BOOM.  The  headset  imparts  a  feeling  of  being  immersed  in  the  virtual  world,  whereas 
the  BOOM  is  more  accessible  for  scientific  data  exploration. 
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2.1.3.  Software  Interface 


More  significant  than  the  differences  in  the  two  display  technologies  are  the  dissimilarities 
in  the  software  interface  for  creating  virtual  environments.  The  VPL  system  provides  a 
program  called  Swivel  to  create  virtual  worlds  [22].  Essentially  a  drawing  program,  Swivel 
allows  the  user  to  create  objects  and  place  them  in  the  virtual  world,  with  dimensions  of 
1000  x  1000  x  1000  units.  Individual  constraints  and  relationships  between  objects  can  be 
specified.  Though  it  is  possible  to  detect  collisions  between  objects,  interference  checking  is 
not  automatic;  hence,  creating  solid,  impassable  objects  requires  explicit  testing  for  collision. 

Posing  another  restriction,  Swivel  provides  no  translators  for  importing  models,  such 
as  CADD.  For  these  cases,  the  user  must  completely  reconstruct  the  model,  using  Body 
Electric,  a  programming  language  specific  to  Swivel  [3].  Externally  developed  codes  can¬ 
not  be  linked  into  the  system.  Consequently,  the  VPL  software  provides  a  rather  closed 
programming  environment. 

No  software  is  provided  with  the  BOOM.  Working  code  can  be  obtained,  nonetheless, 
from  various  organizations  which  have  a  BOOM.  In  general,  such  software  is  based  on 
functions  originally  developed  at  NASA  Ames  [4].  These  functions  read  information  from 
the  headset  and  the  DataGlove  and  control  the  stereo  mode  of  the  display.  This  relatively 
low  level  of  functionality  allows  total  flexibility  for  creating  virtual  worlds:  functions  for 
displaying  the  desired  data  must  be  externally  developed.  This  makes  it  possible  to  view 
essentially  any  type  of  data  which  can  be  drawn  on  the  computer,  including  numerical 
simulation  results,  physical  or  CADD  data.  Thus,  the  BOOM  provides  a  more  open,  flexible 
programming  environment  than  that  offered  by  the  VPL  system. 

2.1.4.  Applications  of  Virtual  Reality 

Our  goal  is  to  apply  virtual  reality  to  the  problems  associated  with  data  interpretation. 
These  problems  differ  from  those  for  simulation  in  which  realism,  in  the  realm  of  the  virtual 
world,  is  the  dominant  requirement.  Employing  virtual  reality  to  analyze  data  should  enable 
the  researcher  to  perceive  the  data  in  new  ways,  discerning  structure  that  is  difficult,  if  not 
impossible,  to  detect  using  traditional  computer  displays. 

To  date,  we  have  created  three  models  for  examination  using  the  BOOM.  The  first 
project  entailed  importing  a  three-dimensional  finite  element  model  of  steady-state  flow 
through  a  channel.  For  the  second  project,  a  volume  model  was  created  using  data  col- 


721 


lected  from  accelerometers  buried  in  soil.  Finally,  the  flight  of  a  projectile  was  examined. 
Each  of  the  projects  involved  differing  types  of  data,  yet  in  each,  a  need  existed  for  the 
scientist  to  interact  with  the  model.  For  example,  in  studying  flow  through  the  channel,  it 
is  difficult,  using  two-dimensional  displays,  to  observe  the  helical  flow  patterns  that  develop 
in  the  curves.  Employing  particle  traces  in  virtual  reality,  these  details  can  be  more  readily 
observed. 

Our  attitude  towards  the  use  of  virtual  reality  for  data  interpretation  remains  ambiva¬ 
lent.  Virtual  reality  provides  an  intuitive  model  for  moving  around  in  a  three-dimensional 
world.  However,  visualization  techniques  must  be  adapted  to  facilitate  exploring  the  data 
from  a  perspective  of  immersion  in  the  virtual  world,  rather  than  the  conventional  approach, 
wherein  the  researcher  is  positioned  outside  that  world.  New  paradigms  must  be  developed, 
similar  to  the  shift  involved  in  parallel  versus  sequential  programming. 

2.2.  Data  Bonification 

Researchers  have  been  interested  in  the  applications  of  sound  in  computer  technology  for  the 
last  decade.  Interest  in  the  area  has  increased  as  sound  synthesis  techniques  have  improved, 
making  it  possible  to  produce  digital  audio  signals  in  real  time.  Much  of  the  initial  research 
in  the  area  was  motivated  by  the  need  to  employ  sound  in  the  development  of  computer- 
human  interfaces  for  the  visually  impaired  [16].  Subsumed  within  this  larger  goal  was  the 
desire  to  use  sound  to  interpret  data,  also  called  data  sonification.  However  one  chooses  to 
categorize  the  research,  the  literature  indicates  common  issues  to  be  addressed  and  shared 
findings  upon  which  to  draw  to  effectively  employ  sound  in  computer-human  interaction. 

Among  the  fundamental  issues  which  emerged  from  the  initial  investigations,  it  became 
evident  that  the  human  auditory  system  possessed  unique  attributes  which  could  be  used 
more  fully  in  developing  the  computer-human  irterface  [1].  First,  analagous  to  its  visual 
counterpart,  the  human  auditory  system  can  perceive  certain  physical  dimensions  of  sound, 
such  as  pitch,  volume,  and  duration.  These  lower  level  physical  dimensions  can  be  isolated 
and  manipulated  to  present  information  similar  to  the  way  in  which  the  visual  attributes  of 
an  object  such  as  shape,  size,  or  color  may  be  used  [1,5].  Secondly,  at  a  higher  level,  human 
hearing  represents  a  continually  open  channel  and  can  thus  function  in  the  background, 
detecting  information  without  requiring  full  conscious  attention  [13].  Audio  cues  have  been 
used  for  some  time  to  develop  auditory  warning  systems  which  monitor  aircraft  or  nuclear 
power  plants. 
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Early  studies  also  revealed  that  certain  kinds  of  information  are  possibly  better  un¬ 
derstood  aurally  than  visually,  such  as  time-  varying,  logarithmic,  and  multivariate  data 
[1,  19,  26].  Finally,  initial  research  demonstrated  that  any  efforts  to  employ  sound  in 
computer-human  interaction  m*  .i  address  certain  fundamental  questions:  What  are  the 
most  appropriate  mappings  of  information  to  sound  and  how  can  these  mappings  be  deter¬ 
mined?  Also,  how  do  human  auditory  and  visual  perception  interact  and  how  can  this  be 
maximally  used  to  convey  information  [16,  18]? 

2.2.1.  Current  Sound  Technology 

Lack  of  necessary  hardware  was  a  genuine  impediment  to  early  research  efforts  involving 
computer-generated  sound.  As  sound  synthesis  techniques  have  improved,  the  applications 
of  sound  in  computer  technology  have  accelerated.  This  section  briefly  reviews  current 
technology  for  generating  computer-synthesized  sound. 

The  Musical  Instrument  Digital  Interface  or  MIDI,  is  a  standard  adopted  by  the  music 
industry  for  interfacing  electronic  sound  synthesis  and  processing  equipment  to  computers. 
MIDI  can  be  described  as  something  akin  to  the  printer  language  Postscript  [20].  However, 
instead  of  sending  page-description  instructions,  MIDI  conveys  music-description  signals 
to  a  synthesizer.  The  standard  has  made  a  wider  range  of  equipment  available  to  the 
researcher. 

It  should  be  noted  that  prior  to  development  of  MIDI  synthesizers,  some  experimentalists 
performed  sound  synthesis  in  software  [17].  The  early  sound  synthesis  languages  (Music 
N  languages)  were  modular  and  flexible,  but  could  not  produce  sound  in  real  time  [15]. 
MIDI  synthesizers  overcame  this  limitation,  implementing  the  sound  synthesis  algorithms 
in  hardware  to  produce  digital  audio  in  real  time.  However,  because  the  hardware  is  pre¬ 
programmed,  MIDI  synthesizers  alone  do  not  offer  the  flexibility  of  software  synthesis  and 
require  additional  software  and  hardware  components  to  perform  experimental  research. 

Digital  signal  processors  (DSPs)  offer  another  alternative  for  sound  synthesis.  DSPs  are 
microprocessors  specifically  designed  for  high-speed  digital  signal  processing  operations. 
Implementing  the  sound  synthesis  algorithms  in  software  to  be  executed  on  one  or  several 
DSPs  in  parallel  can  yield  the  flexibility  of  software  synthesis  and  the  real-time  response  of 
MIDI  synthesizers. 


723 


2.2.2.  Current  Data  Sonification  Research 


Current  research  efforts  have  attempted  to  address  the  various  issues  raised  in  the  ini¬ 
tial  investigations.  In  one  such  project,  researchers  at  the  National  Center  for  Supercom¬ 
puter  Applications  have  employed  a  sound  specification  system  to  test  the  effectiveness  of 
sonification  in  assisting  researchers  to  analyze  complex  data  [7],  The  research  team  ex¬ 
plored  the  problem  of  generating  optimum  data-to-sound  mappings  by  testing  what  they 
term  both  "abstract”  and  ”data-related”  mappings.  To  realize  the  goals  of  the  project, 
the  investigators  employed  a  sound  specification  system  to  analyze  data  sets  from  selected 
NCSA-produced  scientific  visualizations.  The  sound  specification  system  incorporated  both 
a  general-purpose  hardware  component  and  an  object-oriented  software  component.  The 
project  culminated  in  the  production  of  Bonifications  which  augment  the  selected  visualiza¬ 
tions.  Results  of  the  project  indicated  the  significance  of  cognitive  factors  in  determining 
effective  data-to-sound  mappings.  The  investigators  point  out,  however,  that  more  thor¬ 
ough  exploration  and  testing  of  data  mappings  is  required  to  develop  sonification  as  a  data 
interpretation  technique  [7]. 

Researchers  at  the  University  of  Lowell  have  examined  the  effectiveness  of  sound  in  mul¬ 
tivariate  data  interpretation,  particularly  the  interaction  between  visual  and  aural  percep¬ 
tion  in  the  interpretive  process  (14).  The  project  has  entailed  construction  of  an  exploratory 
visualization  tool,  Exvis.  The  tool  seeks  to  exploit  the  human  ability  to  perceive  texture, 
both  visually  and  aurally.  The  developers  argue  that  a  texture,  made  up  of  numerous  dis- 
criminable  elements,  permits  deliberate  analysis  of  individual  elements,  yet  also,  without 
requiring  deliberation,  provides  an  overall  impression  of  the  data  [14].  Each  data  sample 
is  represented  visually  by  a  stick-figure  graphical  unit  called  a  glyph  or  "icon”  [14].  The 
data  parameters  determine  the  attributes  of  an  icon,  such  as  the  size  or  position  of  its  parts. 
Variations  in  shape,  size,  spacing,  or  orientation  create  textural  gradients  or  contours,  which 
indicate  to  the  viewer  structures  of  potential  significance  in  the  data.  Each  icon  has  auditory 
as  well  as  visual  attributes.  An  icon  may  generate  a  single  tone  or  other  sound,  depend¬ 
ing  on  values  assigned  to  its  attributes.  Multiple  icons  sounding  simultaneously  produce 
auditory  textures  similar  to  visual  textures. 
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2.2.3.  Applications  of  Sound  Technology 

Comparing  the  two  studies,  the  NCSA  project  adopted  a  more  general  focus  by  employing 
a  sound  specification  system  to  manipulate  audio  attributes.  Combining  such  an  approach 
with  the  Exvis  emphasis  on  multivariate  data  appears  to  be  optimal  for  employing  sound 
to  interpret  complex  data.  Presently,  we  are  in  the  initial  stages  of  investigating  a  sound 
specification  system  and  possible  data  mapping  techniques. 

Several  studies  are  being  evaluated  for  the  application  of  data  Bonification  techniques. 
Those  involving  multivariate  data  sets  from  both  physical  and  numerical  systems  appear 
particularly  well-suited.  Examples  include  the  Chesapeake  Bay  study  and  groundwater 
investigations:  employing  sound  while  moving  through  three-dimensional  vector  and  scalar 
fields  could  yield  valuable  insight.  A  final  application  would  be  the  development  of  user 
interfaces  to  scientific  programs  for  the  visually  impaired. 

3.  Conclusions 

Scientific  visualization  has  extended  the  boundaries  of  traditional  graphics.  Nonetheless, 
new  methods  must  be  developed  which  address  the  interpretive  requirements  of  more  com¬ 
plex,  time-dependent  systems.  This  paper  has  examined  two  possible  technologies,  virtual 
reality  and  data  sonification.  Each  offers  potential  benefits  not  offered  by  conventional 
methods.  Through  ongoing  studies,  we  expect  to  determine  the  viability  of  these  technolo¬ 
gies  for  exploring  scientific  data. 
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1.  INTRODUCTION 

There  were  three  major  goals  of  this  paper.  The  first  was  to  utilize  the  massively  parallel 
Connection  Machine,  CM,  for  computation  of  flow  about  Army  projectiles  and  missiles  using  a  zonal 
code.  Multi-zone  grids  on  the  order  of  several  million  points  have  been  used  to  model  real-world 
three-dimensional  geometries  with  discontinuities.  This  allows  breaking  up  of  a  complex 
computational  domain  into  blocks  of  simple  grids.  Overlaid  multi-zone  grids  were  considered  for  the 
study.  Since  the  quality  of  the  grid  har  a  direct  impact  on  the  quality  of  the  solution,  the  grid  is  a 
dominant  factor. 

We  have  also  retained  the  complex  boundary  conditions  associated  with  practical  applications.  In 
particular,  the  boundary  conditions  were  neither  simplified  nor  hard  coded  inside  the  code  for 
computational  efficiency.  They  are  set  up  in  modular  blocks  that  make  it  easy  to  use  the  same  code 
for  a  variety  of  applications. 

Our  second  goal,  again  emphasizing  practical  applications,  was  to  compare  performance  of  the 
whole  application  to  the  code  kernel.  The  importance  of  the  communication  overhead  on  the  CM,  and 
communications  issues  related  to  the  single-zone  grid  with  multi-zone  applications  are  addressed. 

Our  third  goal  was  to  demonstrate  the  usefulness  of  a  remote  visualization  capability  developed  in 
conjunction  with  the  massively  parallel  computations.  Applications  involving  millions  of  grid-points 
on  a  remotely  located  CM  require  the  transfer  of  enormous  amounts  of  data  to  a  local  machine  for 
visualization.  This  requires  special  consideration  when  one  is  interested  in  capturing  real-time 
dynamic  or  unsteady  behavior  of  the  flow  solution. 

2.  CONNECTION  MACHINE 

The  CM  is  a  single  instruction  multiple  data  (S1MD)  parallel  computer.  A  full  size  Connection 
Machine  of  this  class  has  65,536  (  =218  )  bit  serial  processors,  16  processors  to  a  chip.  The  chip  also 
contains  router  circuitry  for  inter-processor  communication.  The  CM  has  one  floating-point  chip  for 
every  32  processors.  A  "node"  consists  of  two  processor  chips,  a  floating  point  chip  and  memory. 
Thus  a  full  size  CM  has  2048  nodes.  The  CM  located  at  the  Army  High  Performance  Computing 
Research  Center  (AHPCRC,  Univeristy  of  Minnesota)  has  1024  nodes  or  32,768  bit  serial  processors. 

Because  of  the  SIMD  nature  of  the  CM  it  is  not  possible  to  overlap  computation  with 
communication.  Thus  communication,  if  not  implemented  properly  can  significantly  degrade  the 
overall  performance.  The  CM  has  a  regular  grid  communication  feature  within  its  hypercube 
topology.  This  grid  or  "NEWS"  communication  is  designed  in  such  a  way  that  grid  neighbors  are 
assigned  to  hypercube  neighbors  in  the  communication  network.  This  allows  every  processor,  in 
parallel,  to  pass  data  to  its  neighbor,  all  in  the  same  direction.  Since  in  our  finite-difference 
application  the  communication  pattern  is  regular,  we  have  used  the  above  described  grid 
commmunication  in  the  code.  The  cost  of  grid  communication  is  typically  very  competetive  with  the 
cost  of  the  basic  arithmetic.  On  the  other  hand,  random  access  or  long-distance  inter  processor 
communication  is  far  more  expensive  than  regular  grid  communication.  Although  it  is  possible  to  pass 
data  from  each  node  to  all  four  neighbors  simultaneously,  using  a  special  micro-coded  communication 
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primitive,  this  has  not  been  explored  because  it  is  not  considered  a  general  purpose  feature  at  the 
present  time. 

The  CM  located  at  the  AHPCRC  has  128  Kbytes  of  memory  per  processor  for  a  total  of  4  Gbytes 
of  memory.  It  has  a  SUN  front  end,  consisting  of  two  4/490  systems,  to  provide  compiles,  loads  and 
communications.  It  also  has  a  10  Gbyte  parallel  data  vault  storage  system  which  provides  disk  storage 
accessible  at  a  rate  of  24  MBytes/second.  This  consists  of  42  SCSI  disks  operating  in  parallel  and  the 
special  software  libraries  and  utilities  needed  to  use  the  system. 

3.  NAVIER-STOKES  ALGORITHM 

Since  the  architecture  of  the  SIMD  CM  is  well  suited  for  an  explicit  type  method,  an  explicit 
multi-stage  finite-difference  method  has  been  adopted  [Ref.  l].  The  code  solves  Navier-Stokes 
equations  in  generalized  coordinates  with  no  thin  layer  assumption.  The  three-dimensional  Navier- 
Stokes  equations  in  generalized  curvilinear  coordinates  can  be  found  in  [Ref.  1].  We  can  write  the 
equations  as: 

dQ  d(E-S)  d(F-T)  dlG-R)  n 
dt  +  at  +  dr,  +  d( 

where  Q  is  the  vector  of  the  conserved  variables,  E,  F,  G  are  the  inviscid  flux  terms  and  S,  T,  R  are 
the  viscous  terms.  The  code  is  capable  of  performing  both  steady  and  unsteady  computations.  For 
steady  flows,  a  local  time  stepping  convergence  acceleration  scheme  is  used. 

We  have  added  adaptive  artificial  dissipation  terms  to  the  equations.  These  terms  are  required  to 
damp  out  high  frequency  oscillations  associated  with  the  odd-even  decoupling  in  central-difference 
schemes.  Also,  the  artificial  dissipative  terms  are  required  to  capture  shock  and  contact  discontinuities 
without  undesirable  oscillations.  An  adaptive  blend  of  second  and  fourth  differences  has  been  used  for 
artificial  dissipation.  The  dissipation  terms  have  been  improved  by  a  directional  eigenvalue  scaling 
[Ref.  2]  which  has  been  found  effective  for  highly  stretched  grids  in  both  inviscid  and  boundary 
layer /wake  regions. 

For  a  time  accurate  solution,  the  explicit  scheme  requires  a  limited  time  step  size  that  must  be 
determined  by  the  numerical  stability  criterion  of  the  scheme.  The  time  step  size  is  normally 
determined  such  that  the  Courant-Friedrichs-Lewy  (CFL)  condition  is  a  minimum  over  all  grid  cells. 
However,  convergence  to  the  steady  state  solution  can  be  accelerated  by  sacrificing  the  time  accuracy 
of  the  scheme  and  advancing  the  solution  at  each  cell  in  time  by  the  maximum  possible  local  time  step 
in  that  cell.  Local  time  stepping  is  also  well  suited  for  parallel  processing  because  it  allows  concurrent 
computation  of  the  time  step  size  in  all  cells  and  avoids  long-distance  communication.  The  code 
allows  the  use  of  multi-zone  overlaid  grids.  The  multi-zone  grid  has  the  potential  to  reduce  the  grid 
generation  complexities  and  to  improve  the  quality  of  the  grids  associated  with  complex  body 
geometries.  Since  the  application  involves  grid  points  of  the  order  of  millions,  the  grid  must  be  saved 
on  the  data-vault,  a  mass  storage  device,  for  fast  parallel  I/O  in  the  code.  If  the  grids  and  solution 
fields  are  not  saved  on  the  data-vault,  the  I/O  time  will  be  prohibitive  for  both  visualization  and 
application. 

For  simple  body  shapes,  such  as  a  body  of  revolution,  the  grid  generation  has  been  carried  out  on 
the  CM  itself  and  grid  files  have  been  saved  on  the  data-vault.  For  complex  body  shapes  requiring  the 
use  of  a  more  involved  grid  generation  code,  grids  have  been  generated  on  a  local  machine  and  the 
binary  grid  files  transferred  to  the  CM.  These  grids  are  placed  on  the  data-vault  by  a  pre-processing 
code  on  the  CM  using  different  file  names  for  each  grid  zone. 

The  solution  process  requires  the  computation  of  flux  vectors  that  includes  artificial  dissipation 
from  the  dependent  variables.  This  can  be  done  concurrently  for  all  interior  grid  points  in  a  given 
zone.  In  the  case  of  a  multi-zone  grid,  the  above  process  is  carried  out  for  each  zone  until  all  zones  are 
finished.  Then  the  solution  vectors  are  obtained  by  solving  the  first  stage  in  parallel  for  all  interior 
points  of  a  given  zone.  Again  the  process  repeats  for  any  additional  zones.  Next,  appropriate 
boundary  conditions  are  updated,  each  in  parallel.  The  major  concerns  when  applying  the  complex 
boundary  conditions,  which  take  longer,  is  that  processors  representing  interior  grid  points  do  not 
participate  in  any  useful  work.  Thus  boundary  conditions  must  be  carefully  considered.  The  inter¬ 
zone  data  exchange  is  done  next  for  overlaid  zones.  Since  eacfh  grid  zone  is  separately  mapped  on  the 
CM,  involving  different  sizes,  a  substantial  amount  of  long-distance  communication  may  be  involved  in 
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the  exchange  process.  Although  no  computation  is  involved  in  this  exchange  process,  the 
communication  over-head  is  impressive.  This  cycle  repeats  for  the  next  stage.  Variants  of  this  cycle 
can  be  explored  by  applying  boundary  conditions  and/or  inter-zone  exchanges  less  frequently  in  each 
stage,  to  achieve  better  performance. 

4.  FLOW  APPLICATION 

Depending  on  the  application,  the  computational  domain  consists  of  either  a  single-block  or  a 
multiple-block  grid.  Each  block  consists  of  a  large  number  of  grid  points.  For  code  implementations, 
the  CM  has  been  configured  as  a  3— D  grid  with  one  (virtual)  processor  per  grid  point  and  each 
processor  connected  to  its  nearest  neighbors.  The  code  was  design^  in  such  a  way  that  that  each 
grid-block  has  its  own  local  variables.  This  approach  allowed  us  to  map  each  grid  block  on  the  entire 
CM.  Each  grid  block  should  contain  a  sufficient  number  of  grid  points  (in  multiples  of  the  physical 
processors)  to  give  a  high  virtual  processor  ratio  for  efficient  computation.  That  is,  each  physical 
processor  will  process  several  grid  points  of  a  single  block  grid.  Thus  for  a  multi-zone  grid,  storage 
requirements  per  physical  processor  can  become  very  large. 

The  inter-processor  communications  for  each  grid  block  is  relatively  efficient  because  of  the 
previously  mentioned  dedicated  "NEWS"  network.  Applications  involving  multi-block  grids  typically 
have  grid  blocks  of  different  sizes.  Thus  there  is  substantial  "long-distance"  communication  involved 
in  inter-block  data  exchange.  This  is  one  of  the  major  issues  involved  in  comparison  of  a  multi-block 
application  with  a  single-block  application.  Hence,  long-distance  interprocessor  communication  is  the 
first  important  issue.  Because  the  CM  is  SIMD,  it  is  not  practical  to  overlap  computation  with  the 
required  interprocessor  communication  and  thus  the  overhead  generated  by  communication  has  a 
direct  effect  on  performance. 

During  the  computation  of  boundary  conditions,  un-needed  processors  will  not  be  performing  useful 
work.  Since  our  goal  is  to  maximize  the  rate  of.  useful  computations,  the  implementation  of  boundary 
calculations  is  the  second  important  issue. 

For  some  applications,  the  boundary  stencil  can  be  made  part  of  the  interior  stencil  by  using  an 
array  that  has  desired  factor  values  on  the  appropriate  boundaries  and  interior  factor  values 
corresponding  to  the  physical  position  in  the  computational  field.  The  overhead  imposed  by  the 
inclusion  of  additional  terms  when  solving  the  interior  and  boundary  points  simultaneously  is 
relatively  small  on  the  CM.  Hence  the  application  of  difference  stencils  to  virtual  processors 
representing  boundary  points  can  be  performed  concurrently  with  interior  points.  Also  the  Dirichlet 
and  periodic  type  of  boundary  conditions  are  easy  to  implement  on  the  CM.  For  many  applications, 
the  type  of  boundary  conditions  required  is  rather  complex.  There  is  little  or  no  similarity  in  the 
difference  stencils  between  the  interior  and  boundary  points.  This  is  because  of  the  nature  of  the 
boundary  conditions  or  higher  order  approximations  applied  to  the  boundary  conditions.  Also,  the 
boundary  conditions  are  evaluated  after  the  interior  points  have  been  updated.  Thus  depending  on  the 
type  of  boundary  conditions,  separate  evaluation  of  boundary  conditions  impose  an  overhead  on  a 
SIMD  massively  parallel  architecture  that  mostly  depends  on  the  type  of  application  involved  and  the 
complexity  of  the  boundary  condition.  Although  all  boundary  points  in  a  given  boundary  condition 
can  be  executed  in  parallel,  still  a  large  ratio  of  virtual  processors  might  not  be  doing  useful  work.  So 
the  type  of  applications  and  the  type  of  boundary  conditions  used  play  an  important  role  in  comparing 
performance  with  other  machines. 

A  number  of  applications  have  been  computed  to  validate  the  accuracy  of  the  Zonal  code  on  the 
CM.  Both  2— D  axisymmetric  and  3— D  flows  have  been  solved  and  compared  with  the  experimental 
data  and  results  from  Cray  computations.  The  parallel  processing  effort  at  BRL  was  initiated  in  late 
1983  with  the  acquisiiton  of  a  64  processor  Denelcor  HEP  MIMD  machine  which  provided  an  initial 
platform  for  numerical  experiments  with  parallel  algorithms.  Some  of  the  techniques  implemented  in 
the  Connection  Machine  code,  such  as  multi-zone  data  exchange,  local  variables  for  each  zone,  and 
data  management  were  delt  with  experimentally  on  that  computer.  Also  programming  experience 
with  the  Cyber— 200  series  using  explicit  vector  directives  contributed  to  the  CM  code,  because  it  had 
many  directives  similar  to  CM  Fortran.  With  the  availability  of  the  Crays,  the  code  has  been 
validated  and  applied  to  real  world  problems.  The  code  has  consistently  sustained  a  parallel  efficiency 
of  about  98%  for  3— D  applications  on  4  and  8  processor  Crays  [Ref.  3]. 

We  have  tried  to  satisfy  our  primary  aim  of  performance  by  using  efficient  numerical  schemes  and 
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simple  data  management  based  on  modular  building  blocks  for  flexibility.  The  code  has  been  written 
from  scratch  for  the  CM,  working  with  one  module  at  a  time  in  order  to  simplify  the  development  and 
enhance  portability.  Once  the  blocks  are  identified  and  the  parallel  directives  are  implemented,  then 
an  efficient  code  is  developed  by  putting  these  modules  together  in  an  appropriate  way.  Thus  we  have 
been  able  to  port  the  parallel  code  efficiently  from  one  computer  to  another  [Ref.  4j.  The  code  has 
modules  that  are  capable  of  computing  aerodynamic  forces  and  coefficients  such  as  drag  and  pitching 
moments  acting  on  the  body.  These  coefficients  along  with  the  surface  pressure  distribution  and 
velocity  profiles  {when  available)  are  compared  with  the  experimental  data. 

We  now  mention  some  of  the  application  studies  that  have  been  performed  using  the  zonal  code. 
All  of  the  following  studies  used  multi-zone  (block)  grids  involving  2  to  15  zones.  The  effects  of 
projectile  nose  bluntness  has  been  reported  in  [Ref.  1).  References  [5]  and  [6]  present  detailed 
discussion  on  studies  of  spike-nozed  HEAT  (High  Explosive  Anti-Tank)  projectiles.  The  non-steady 
behavior  of  the  flow  is  interesting  and  real-time  visualization  may  provide  more  insight  of  the  flow 
behavior.  The  viscous  flow  computations  have  been  performed  to  study  missiles  with  delta  wings  at 
various  angles  of  attack  [Ref.  7j.  The  formations  of  leading  edge  vortices  and  their  effect  on  the  wing 
pressure  distribution  has  been  investigated.  The  projectile  with  a  cylindrical  wrap-around  fin,  which 
allows  efficient  use  of  the  available  space,  has  been  investigated  in  [Ref.  8j.  Computations  for  a  fully 
integrated  engine  including  inlet,  combustor  and  nozzle  for  the  National  Aerospace  Plane  or  X— 30  are 
described  in  [Ref.  9). 

In  the  remainder  of  this  section  we  show  a  sampling  of  flow  computations  that  has  been  performed 
on  the  CM  at  the  AHPCRC  using  the  zonal  code.  First  we  consider  the  Army  M549  artillery  shell. 
This  is  the  155mm  projectile  with  a  flat  nose  and  a  flat  base.  Our  interest  is  in  the  computation  of 
supersonic  flow  and  determination  of  static  aerodynamic  coefficients.  Because  of  the  sharp  edged 
configuration  at  the  nose-forebody  junction  and  at  the  boattail-base  junction,  a  three  zone  grid  has 
been  used.  The  3-zone  grid  for  the  M549  is  shown  in  Figure  1. 

The  projectile  shape  consists  of  a  flat  nose,  forebody  section  of  ogive  and  cylinder  and  aftbody 
section  of  the  boattail  and  flat  base.  The  zonal  grid  preserves  the  actual  corners  in  the  nose  and  base 
area.  The  zonal  grids  are  overlaid  by  a  single  cell  with  the  neighboring  grids  without  any  mismatch 
and  thus  inter-zone  boundary  conditions  only  require  data  transfer  at  the  interfaces.  A  non-reflecting 
boundary  condition  was  imposed  on  the  outer  boundary  [Ref.  1].  This  approach  allows  setting  of  the 
outerfield  close  to  the  body.  From  the  computational  point  of  view,  it  allows  the  removal  of  a 
relatively  large  number  of  grid  points  from  the  outer  region  for  supersonic  flow  computations.  This 
enables  us  to  save  memory  space  since  the  memory  is  proportional  to  the  total  number  of  grid  points. 
On  the  Cray  computers,  the  eliminated  grid  points  result  in  decreased  computation  time.  However, 
the  computation  involved  in  non-reflecting  type  boundary  conditions  is  relatively  more  involved  than 
simple  free-stream  type  boundary  conditions.  The  overhead  imposed  by  complex  boundary  conditions 
is  relatively  low  on  Crays  compared  to  the  CM. 

For  Mach  =  1.5  and  a  3°  angle  of  attack,  the  solution  clearly  shows  the  asymmetry  in  the  shock 
wave  in  the  nose  area,  the  region  of  over-expansion  and  recompression  [Figure  2j.  Notice  that  the  use 
of  non-reflecting  boundary  conditions  allow  the  shock  to  pass  through  the  outer  boundary.  A  sequence 
of  frames  from  the  visualization  of  the  computed  flow  fields  reveals  unsteady  vortex  shedding  with 
vortices  being  formed  and  shed  in  the  wake  which  raises  questions  about  dislocation  of  events.  Thus 
real-time  visualization  has  opened  up  an  area  that  needs  further  investigation.  It  is  interesting  to  note 
that  unsteady  flow  behavior  has  been  reported  in  [Ref.  5]  and  [Ref.  6]  which  was  not  previously  noticed 
in  experiments.  As  the  Mach  number  is  increased,  the  recirculation  region  becomes  entrapped  in  the 
base  area.  Again  the  real-time  visualization  may  prove  a  key  element  in  this  type  of  study. 

Two  three-zone  grids  have  been  used  in  this  study.  The  grid  dimensions  used  for  the  first  grid  are 
(30x94x39),  (249x65x39)  and  (120x46x39)  respectively.  The  second  three-zone  grid  has  dimension  size 
of  (32x128x64),  (256x128x64),  and  (128x64x64).  The  performance  of  the  entire  code  for  the  M549  on 
the  CM  with  32K  procesors  or  1048  nodes  is  about  1400  Mflops.  This  compares  to  performance  of  the 
same  application  on  a  Cray— 2  at  90  Mflops  using  a  single  procesor  and  340  Mflops  on  using  four 
processors.  A  Cray— 2  was  used  because  of  the  large  memory  required  for  the  3— D  applications. 

The  entire  M549  application,  including  various  boundary  conditions  and  inter-zone 
communications  takes  about  12  sec/interation  on  a  32K  processor  machine.  The  detailed  breakdown 
of  the  timing  reveals  that  the  code  spends  about  4  sec/iteration  solving  the  kernel  (interior  grid  points) 
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and  8  sec/iteration  applying  the  boundary  conditions  including  zonal  interface  data  transfer  (or 
working  on  the  boundary  points).  As  discussed  previously,  the  work  involved  on  the  boundary  can 
impose  impressive  overhead.  So  there  can  be  a  huge  gap  when  comparing  the  performance  of  the 
kernel  with  the  performance  of  the  entire  code.  In  our  case  inter-zone  comunication  has  turned  out  to 
be  one  of  the  most  expensive  operations  on  the  CM  because  of  the  long-distance  communication 
involved.  Still,  the  performance  of  the  M549  application  on  the  CM  is  very  encouraging  and  more 
importantly  there  is  room  for  future  applications  that  involve  larger  grids.  It  is  not  difficult  to  come 
up  with  applications  involving  single-block  grid  and  simple  boundary  conditions  that  can  be  executed 
concurrently  with  the  interior  points  and  obtain  much  better  performance  than  what  we  hav  >  obtained 
for  the  M549  application.  We  are  focusing  on  the  issues  that  we  encountered  while  keeping  the  code 
flexible  and  practical  enough  for  a  variety  of  applications. 

We  have  performed  3— D  flow  computations  on  the  CM  for  the  U.  S.  Army  Missile  Command 
(MICOM).  A  missile  configuration  associated  with  a  sled  test  flow  field  has  been  considered  [Ref.  10]. 
The  missile  configuration  has  been  gridded  using  a  single  zone  grid  of  256x64x64  [Figure  3j.  The  high 
Reynolds  number  computations  have  been  performed  for  the  Mach  1.9  case  at  a  0°  angle  of  attack 
[Figure  4] . 

An  application  involving  a  little  more  complex  body  shape  is  shown  in  Figure  5.  This  is  the  full 
F— 15  aircraft  configuration.  Because  of  the  body  shape,  the  multi-zone  gridding  of  this  configuration 
is  more  involved  and  three  institutions  have  been  involved  (National  Science  Foundation  — 
Engineering  Research  Center,  McDonnell  Douglas  Corp.  and  the  U.  S.  Air  Force)  [Ref.  1 1  ] .  Once  the 
grid  has  been  obtained,  the  computations  about  the  entire  aircraft  configuration  have  been  carried  out 
using  the  Zonal  code  on  the  CM.  The  surface  pressure  distribution  for  F— 15  at  Mach  =  1.5  and  a  0° 
angle  of  attack  is  shown  in  Figure  6.  A  more  interesting  area  of  research  would  be  to  make  a  video 
similar  to  the  M549,  capturing  a  series  of  flow  frames  involving  an  aircraft  maneuvering  such  that  roll 
or  pitch  are  changing.  The  visualization  capabilities  discussed  in  the  next  section  may  prove  an 
indispensible  tool  for  the  above  mentioned  study.  A  detailed  discussion  of  the  F— 15  results  will  be 
documented  in  an  appropriate  future  report. 

5.  VISUALIZATION 

Visualization  of  flow  data  demands  rendering  !ai&e  quantities  of  data  which  will  require 
improvements  to  the  traditional  techniques.  While  machines  like  the  CM  have  attached  frame  buFers, 
these  machines  are  typically  located  at  a  remote  site  making  it  desirable  to  transfer  the  data  to  a  local 
machine  for  visualization.  The  massive  amount  of  data  makes  it  difficult  to  transfer  files,  particularly 
to  different  architectures  or  workstations  with  modest  disk  capacity. 

One  approach  is  to  transfer  the  data  to  a  local  machine  with  sufficient  disk  capacity  and  do  the 
visualization  almost  entirely  in  software.  This  allows  the  visualization  to  be  accomplished  on 
machines  without  special  graphics  hardware.  If  the  transfer  and  visualization  steps  are  separated,  the 
computation,  visualization,  and  storage  can  happen  concurrently,  and  each  process  can  be  performed 
on  the  most  appropriate  architecture. 

For  these  reasons,  the  visualization  of  flow  data  from  the  Connection  Machine  has  been  broken  into 
two  steps:  1)  Transferring  the  data  [Ref.  13]  and  2)  Displaying  and  manipulating  images.  To  allow 
flexibility  in  choosing  the  platforms  for  transfer  and  visualization,  both  steps  are  portable;  relying  on 
no  special  hardware  or  architecture  specific  software. 

Each  timestep  of  the  Navier-Stokes  calculation  produces  several  scalar  values  at  each  grid  point 
(density,  X,  Y,  Z  momentum  and  energy).  Every  tenth  timestep  is  dumped  to  a  file  on  the  Data  Vault. 
The  filename,  along  with  the  size,  shape,  and  normalizing  values  is  written  to  a  shared  memory 
segment  queue  on  the  front-end  processor.  The  Zonal  code  is  then  free  to  resume  processing,  resulting 
in  a  minimal  impact  on  code  performance. 

A  separate  process,  on  another  quadrant,  reads  the  information  from  the  shared  memory  queue, 
opens  the  Data  Vault  file,  reads  it  into  the  CM,  and  normalizes  the  data.  Then  using  RdT,  a  data 
transfer  library,  selected  planes  are  transferred  to  a  local  processor.  The  data  is  transferred  in 
eXternal  Data  Representation  (XDR)  format,  developed  by  Sun  Microsystems  [Ref.  12]  and  stored  on 
the  local  machine  in  host  binary  format.  Once  on  the  local  machine,  the  data  is  visualized  in  an  Xll 
window  using  ShAYD,  a  collection  of  hardware  independent  routines  that  is  used  to  visualize 
curvilinear,  multi-zone  grids  and  their  scalar  values.  Since  all  of  the  rendering  is  done  in  software, 
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massive  grids  can  be  handled,  one  small  section  at  a  time,  to  produce  a  final  image.  Each  scalar  value 
is  mapped  to  a  hue  and  a  light  source  can  be  optionally  applied.  The  hue  is  interpolated  across  each 
polygon  and  used,  along  with  the  light  value,  to  produce  either  an  value  in  the  Xll  color  map  or  a  24 
bit  RGB  value.  The  24  bit  output  is  used  to  produce  videos  of  many  timesteps. 

6.  SUMMARY 

A  paralh  multi-zone,  three  dimensional,  Navier-Stokes  code  has  been  developed  on  the 
Connection  Machine  CM  for  computing  steady  and  unsteady  flows.  Results  are  presented  which 
clearly  demonstrate  that  real-world  applications  in  one  area  can  be  handled  efficiently  on  this 
massively  parallel  architecture.  The  applications  also  demonstrate  the  viscous  flow  computations  at 
high  Reynolds  number.  The  graphics  capabilities  developed  in  conjunction  with  the  parallel 
computation  are  very  useful  for  visualizing  the  flow  and  debugging  the  computation. 

The  performance  of  the  algorithm  results  more  from  simplicity  and  data  managment  than  from 
sophisticated  numerics.  We  have  highlighted  issues  that  we  encountered  in  addressing  real-world 
applications  on  the  CM  machine.  There  can  be  a  big  gap  in  the  performance  of  different  applications 
depending  on  the  type  of  boundary  conditions  and  long-distance  communications  involved. 

The  basic  theme  of  the  research  effort  is  simplicity.  By  putting  simple  modules  together  in  an 
appropriate  way  and  avoiding  unnecessary  complexity  (the  law  of  "diminishing  returns"),  we  have 
produced  a  useful  and  efficient  algorithm  on  the  massively  parallel  architecture. 
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ABSTRACT 

The  flow  of  groundwater  in  the  vicinity  of  certain  critical  military  arsenals  is 
of  increasing  importance  because  of  environmental  impact.  Further,  there  are  many 
techniques  used  in  military  applications  that  can  be  used  to  solve  problems  in  other 
disciplines  (knowledge  transfer).  Therefore,  this  paper  describes  new  advances  in  the 
computational  modeling  of  groundwater  and  seepage  using  the  finite  element  method 
(FEM)  in  conjunction  with  tools  and  techniques  typically  used  by  aerospace 
engineers.  First,  an  extension  of  the  technique  of  generating  an  orthogonal 
structured  grid  to  automatically  generate  a  flow  net  for  two-dimensional  problems 
is  presented.  Second,  a  complete  implementation  of  a  three-dimensional  (3-D) 
seepage/groundwater  model  is  described  where  (1)  grid  generation  is  accomplished 
using  the  Eglin  Arbitrary  geometry  Implicit  £uler  (EAGLE)  program  developed 
by  Eglin  Air  Force  Base  and  Mississippi  State  University,  (2)  the  seepage  and 
groundwater  analysis  for  either  confined  or  unconfined  flow,  homogeneous  or 
inhomogeneous  media,  and  isotropic  or  anisotropic  soil  is  done  with  no  restriction 
on  the  FE  grid  or  requirement  of  an  initial  guess  of  the  free  surface  for  unconfined 
flow  problems,  and  (3)  scientific  visualization  is  performed  using  the  program  JFlow 
Analysis  Solver  Toolkit  (FAST)  developed  by  NASA  Ames.  Special  emphasis  is 
placed  on  the  proper  development  of  boundary  conditions  at  exit  faces  and  wells  for 
unconfined  flow  or  unsaturated  flow  problems,  which  are  often  done  incorrectly. 
Finally,  examples  showing  both  generated  flow  nets  for  2-D  problems  and  results  for 
3-D  applications  are  presented. 
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INTRODUCTION 


The  modeling  of  seepage  under  dams  and  groundwater  flow  in  aquifers  is  of 
significant  interest.  This  becomes  even  more  important  in  our  modem  times  with 
increasing  interest  in  the  flow  of  pollutants.  The  unsolved  environmental  issues 
regarding  our  hazardous  and  toxic  waste  problems  must  be  resolved,  and  significant 
resources  must  be  placed  on  this  effort.  Some  military  bases  are  contaminated  with 
hazardous  waste  that  has  entered  the  groundwater  domain.  A  groundwater  model 
that  takes  into  account  contaminant  flow  is  therefore  critical.  The  state  of  the  art 
has  advanced  in  various  ways  over  the  years  to  achieve  better  and  better  solutions. 
However,  of  unusual  occurrence  is  the  application  of  the  tools  that  engineers  in  one 
discipline  have  developed  to  problems  of  other  disciplines.  What  is  said  is,  "We  don't 
do  it  that  way."  Because  of  the  authors'  diverse  background,  a  unique  feature  of  the 
work  in  this  paper  is  that  the  tools  developed  by  structural  and  aerospace  engineers 
are  applied  to  a  problem  typically  addressed  by  others. 

Difficulty  of  Problem 

One  profound  aspect  of  seepage  and  groundwater  flow  is  the  problem  of 
modeling  flow  through  materials  of  significantly  different  characteristics 
(permeabilities,  hydraulic  conductivities,  etc.).  This  is  compounded  when  unconfined, 
unsaturated,  or  multiphase  flow  exists.  One  example  of  this  is  an  earth  dam  with  a 
relatively  impervious  clay  core  and  a  highly  pervious  drain  installed  around  it.  The 
rest  of  the  dam  is  composed  of  moderately  porous  material.  Figure  1  shows  an 
example  with  the  soil  properties  given  in  Table  1. 
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Figure  1.  Zoned  earth  dam 
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Soil 


Material 


‘2 


Angle,  deg 


1 

Rock 

9.3(10‘2) 

1.7(10"2) 

140 

2 

Sand 

9.8(10'2) 

2.0(10‘2) 

0 

3 

Drain 

9.8(10‘2) 

2.0(10*2) 

0 

4 

Shell 

9.8(10*1) 

9.8(10'1) 

0 

5 

Random 

9.8(10‘3) 

9.8(10'3)  • 

0 

6 

Core 

9.2(10-s) 

2.0(l(rs) 

0 

7 

Random 

9.8(10'3) 

9.8(icr3) 

0 

8 

Grout 

9.8(10'3) 

2.0(10'3) 

140 

Table  1.  Material  properties 


COMPUTER  GENERATED  FLOW  NETS 

The  graphical  construction  of  flow  nets  by  hand  to  compute  the  quantity  of 
flow,  exit  gradient,  etc.  is  a  standard  engineering  tool  of  soils  engineers.  However, 
these  are  extremely  tedious  to  construct  by  hand  because  equipotential  lines  and  flow 
lines  must  be  drawn  in  such  a  way  that  curvilinear  squares  result.  One  significant 
aspect  of  this  research  effort  is  that  numerical  grid  generation  techniques  of 
aerospace  engineers  used  to  generate  an  orthogonal  grid  (Thompson,  Warsi,  and 
Mastin  1985)  can  be  extended  to  construct  a  flow  net  for  various  boundary  conditions 
using  the  Cauchy-Riemann  Equations  (Crowder  and  McCuskey  1964).  This  section 
shows  how  the  FEM  has  been  successfully  applied  to  generate  flow  nets  with 
emphasis  also  given  to  differences  in  approach  from  previous  work.  The  major 
advantage  of  the  techniques  described  in  this  section  is  that  they  improve  the  quality 
of  the  resulting  flow  nets. 

Governing  Equations  and  Basic  Approach 

The  total  head  or  potential  <j)  for  a  homogeneous,  isotropic  medium  for  2-D, 
steady-state  flow  satisfies  Laplace's  equation  as  follows: 


if£  +  =  o 

Bx2  By 2 


(1) 


The  stream  function  ijr  also  satisfies  Laplace's  Equation, 


El  *  El  .  o 


Bx 2  By 2 

Therefore,  a  complex  potential  *  exists  as  follows: 


(2) 


4>  *  <{>  +  jltJt 


Since  <J>  and  ijr  are  conjugate  harmonic  functions,  the  Cauchy-Riemann  equations  now 
hold. 


Bx 


ox  d>  =  Ur 

Qy 

or  <b  =  -ill 
Piv  vy 


(3) 


It  should  be  noted  here  that  the  stream  function  is  often  defined  as  a  velocity-type 
term.  However,  in  this  work  it  is  a  gradient-type  term.  That  is,  +  is  defined  by 


♦  ■  /  ^*dy  ”  *ydX 


as  compared  to 

i|r  =  f  udy  -  vdx 


where  u  is  the  x  component  of  velocity,  and  v  is  the  y  component  of  velocity. 

For  a  point  on  the  curve  C,  a  local  tangent-normal  (T-N)  coordinate  system 
can  be  established  as  shown  in  Figure  2.  Equation  3  for  this  system  then  becomes 
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c 


Figure  2.  T-N  coordinate  system 


4>r  = 

(4) 

=  -*T 

A  property  of  such  functions  is  that  constant  lines  of  <J>  are  orthogonal  to 
constant  lines  of  +.  The  flow  net  consists  of  <J>  =  constant  lines  and  t{r  =  constant 
lines  constructed  in  such  a  way  that  the  resulting  picture  consists  of  curvilinear 
squares.  The  concept  of  automatically  generating  the  flow  net  is  fairly 
straightforward  and  involves  the  following  three  steps: 

a.  Compute  total  head  Perform  a  normal  FEM  solution  determining  the  total 
head  h  (same  as  potential  <J>)  at  each  node  and  the  quantity  of  flow,  Q, 
passing  through  the  system.  Also  compute  the  shape  factor,  f,  from 

0  =  k(hu-hd)f  (5) 

where  hu  is  the  upstream  head,  hd  is  the  downstream  head,  k  is  the 
permeability,  and  f  is  the  shape  factor. 
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b.  Compute  stream  function.  Determine  the  boundary  conditions  for  the 
stream  function  using  Equation  4  and  perform  a  second  FEM  solution  to 
obtain  values  of  ijr  at  each  node. 

c.  Contour  results.  Contour  the  two  sets  of  data  to  construct  the  flow  net. 
The  intervals  for  each  are  determined  using  the  shape  factor  which,  by 
definition,  is 


where  Ne  is  the  number  of  equipotential  drops,  and  Nf  is  the  number  of 
flow  paths. 

Earlier  work  (Christian  1980a,  1980b,  1983;  Aalto  1984;  Christian  1987) 
determined  the  boundary  values  for  the  stream  function  solution  (step  2)  numerically, 
whereas  in  this  work  a  more  fundamental  technique  is  used.  Here,  the 
Cauchy-Riemann  equations  are  used  to  determine  the  correct  boundary  conditions. 

Dupuit's  Problem 

The  procedure  will  now  be  applied  to  Dupuit's  problem  which  is  unconfined 
flow  in  an  earth  dam  with  vertical  sides  (Figure  3).  Line  segment  AB  is  impervious, 
line  segments  AF  and  BC  have  constant  head  specified,  and  line  segment  CD  has  the 
boundary  condition 


<j>  =  y  (7) 

The  position  of  the  free  surface  (FD)  must  be  determined  from  the  FEM  solution. 
Once  determined,  line  segment  FD  becomes  a  flow  line  and  is  treated  exactly  like 
an  impervious  boundary.  Also,  the  region  above  the  free  surface  (triangular  region 
FDE)  is  not  used  for  the  second  solution.  Rather,  a  new  grid  with  the  phreatic 
surface  being  a  new  boundary  is  used.  We  will  now  determine  the  t|r  boundary 
conditions. 

On  the  <t>  =  H!  and  <j>  =  H2  boundaries 

4>r  ■  0 

since  and  H2  are  constants.  But  from  Equation  4 
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Figure  3.  Dupuit's  problem 


<t>r  =  Vh 

So  the  new  boundary  condition  is 

=  0  (8) 

On  the  impervious  boundaries  no  flow  enters,  so  the  normal  component  of 
velocity,  vN,  is  zero.  Thus,  using  Darcy'a  Law  for  a  homogeneous,  isotropic  medium, 

vjr  =  "  =  0 


it  follows  that 
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<t>M  =  0 


Applying  Equation  4  to  the  above  equation  yields 

4>w  =  =  0 


or 


A>t  =  0 


(9) 


Now 


cty  =  yT  dT  +  dN 


Substituting  Equation  9  into  the  above  equation  and  noting  that  dN  =  0  on  the 
boundary  gives 


chjf  =  0 


or 


i|r  =  constant 

The  total  amount  of  stream  function  can  be  shown  to  be 

Rectal  =  J  =  f[hu  "  *d>  (1°) 


Therefore,  apply  the  boundary  conditions  as  follows: 


>|r  =  on  AB 

*  -  ^2  =  +  ♦tot.J  012  FD 


(11) 


where  is  an  arbitrary  constant  set  big  enough  so  the  FEM  program  has  a  confined 
flow  condition  when  computing  stream  function. 


On  the  boundary  CD,  Equation  7  is  insufficient  information  to  determine  the 
new  boundary  conditions.  Therefore,  the  normal  component  of  discharge  velocity  vN 
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is  first  computed  for  each  node  on  the  boundary  CD  with  vN  at  D  being  zero.  It  is 
assumed  that  points  C  and  D  are  node  points,  and  there  are  intermediate  node 
points  I  and  J  (I  is  closest  to  C)  as  well.  Then  for  each  node, 


vN  = 

<1 


Vr  = 


(12) 


A  piecewise  linear  assumption  for  vN  (and  therefore  iJrT)  yields 

^  =  \  <♦„  +  *w>  <r,  ~  Tx)  *  (13) 


Start  I  at  point  D  and  process  consecutive  nodes  using  Equation  13  until  point  C  is 
done.  The  boundary  conditions  are  now  fully  determined  for  the  stream  function 
calculation. 

The  above  formulation  is  not  restricted  to  Dupuit’s  Problem  but  can  also  be 
applied  to  a  wide  variety  of  quadrilateral-type  earth  dams.  The  only  restriction  is 
that  the  problem  should  have  the  five  basic  boundaries  of  impervious  base,  specified 
headwater  and  tailwater,  free  surface,  and  surface  of  seepage.  Figure  4  shows  the 
results  for  Dupuit's  Problem  where  by  Figure  3,  AB  =  AF  =  100  ft,  and  BC  =  20  ft. 
Figure  5  shows  the  computer  generated  flow  net  for  an  earth  dam. 


SURFACE  OF  SEEPAGE  BOUNDARY  CONDITIONS 

The  free  surface  (FD  in  Figure  3)  and  the  exit  point  (point  D)  must  be 
determined  by  an  iterative  process,  and  the  boundary  conditions  along  CDE  are  often 
done  incorrectly.  What  is  typically  done  is  a  no-flow  boundary  condition  is  imposed 
along  the  entire  boundary.  This  is  incorrect  because  water  must  be  allowed  to  exit 
along  CD.  So  each  iteration  for  the  free  surface  must  also  include  the  following 
switches  for  all  surface  of  seepage  nodes: 

a .  Impervious  node.  If  the  pressure  is  greater  than  zero,  switch  the  node  to 
a  specified  head  node. 

b.  Specified  head  node.  If  the  flow  Q  is  not  flowing  out,  switch  the  node  to 
an  impervious  node. 
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2-D  SEEPAGE  PACKAGE 


A2-D  FEM  seepage  program  with  boundary  conditions  implemented  correctly 
and  with  the  flow  net  capability  has  been  written  and  incorporated  into  a  2-D 
seepage  package  which  contains  (1)  grid  generation,  (2)  analysis,  and  (3) 
postprocessor  capability.  A  PC  version  has  been  widely  distributed  and  is  available 
from  the  authors. 


3-D  MODEL 

A  3-D  seepage  and  groundwater  model  was  created  from  the  2-D  FEM 
program.  Structured  pieces  of  the  3-D  grid  for  a  problem  are  generated  by  EAGLE 
(Thompson  1987,  Thompson  and  Gatlin  1988a,  1988b,  1988c)  which  has  extensive 
algebraic  grid  generation  capabilities,  as  well  as  state  of  the  art  elliptic  smoothing 
capabilities.  A  program  was  then  written  to  apply  boundary  conditions  to  individual 
blocks,  combine  blocks,  and  apply  bandwidth  minimization  to  preserve  the  element 
numbering.  By  decoupling  the  geometry  into  blocks,  grid  generation  and  boundary 
condition  application  is  much  less  complicated.  Also,  the  running  time  for  solving 
the  system  of  nonlinear  equations  can  be  significantly  reduced  by  using  a  structured 
multi-block  system.  Further,  the  original  structure  can  be  preserved  to  output  results 
in  the  Flow  Analysis  Solver  Toolkit  (FAST)  (Bancroft,  Kelaita,  McCabe,  Merritt, 
Plessel,  Globus,  and  Semans  1991)  format  for  scientific  visualization. 

Scientific  Visualization  Examples 

The  problem  of  two  partially  penetrating  wells  in  a  rectangular  aquifer  (Figure 
6)  bounded  by  a  river  and  three  impervious  boundaries  will  first  be  given  to  show 
how  groundwater  can  be  visualized. 

An  analytic  solution  to  a  partially  penetrating  well  (Figure  7)  of  thickness  t*, 
penetration  b,  well  radius  r^  permeability  k,  and  flow  Q  has  been  partially  described 
(Muskat  1946),  and  the  complete  solution  will  now  be  given.  First,  define 


where  (r,  z)  =  (0,  0)  is  located  at  the  top  of  the  aquifer  and  at  the  center  of  the  well. 
Next,  define  the  function 
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RIVER 


Figure  7.  Partially  penetrating  well 


(15) 


Z(s,  y) 


£ — 1 — 
So  (n  +  y)  * 


For  small  values  of  a. 


0  &  a  £  0 . 5 


the  total  head,  <j>,  becomes 


4»  =  g{-  log 


Hi  ♦  C  ♦  pim  -  C  + 

m  -  c  -  p>rd  +  c  - 


P) 


♦  loac  i  p  w«2  nc  ug 

c  -  P  +  A2  +  (C  -  P)2 


-  [Z  (2 ,  1  -  C  -  p)  -  Z (2 ,  1  -  c  +  P) 

♦  Z (2 ,  1  +  c  -  p>  -  Z (2 ,  1  +  c  +  P)1 

♦  0(a4)} 
q  is  a  flux  density  given  by 


(16) 


Q  ~  ~ 


0 

4t tkb 


(17) 


For  the  remaining  larger  values  of  a 


4>  =  4g  [-T  -  K„(2nna)  cos(2.n*C)  sin(2mrP) 

(18) 

+  plog-|] 

where  Ko(x)  is  the  modified  Bessel  Function  of  the  second  kind.  For  relatively  large 
values  of  x  (Press,  Flannery,  Teukolsky,  and  Vetteriing  1989), 


754 


K o(*> 


Tt  -jf 

- e  * 


Jinx 


Thus,  an  implementation  of  Equation  18  requires  relatively  few  terms. 

An  infinite  series  of  images,  the  first  eight  being  shown  in  Figure  8,  is  now 
used  to  determine  the  solution.  If  a  well  is  located  at  (%  y0)  and  the  top  of  the 


Figure  8.  Image  wells 


aquifer  is  at  zt,  then  use 


•  *  25' 


C  -  <Z„  - 


in  Equations  16  and  18. 
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The  above  solution  was  done  using  29  image  wells  on  a  large  algebraic  grid 
of  21,266  nodes  and  had  a  maximum  percentage  error  of  5.3  percent.  Actually,  some 
of  this  error  is  attributed  to  the  truncation  in  the  number  of  image  wells  and  some 
is  due  to  numerical  imperfections.  It  seemed  plausible  that  this  grid  could  be 
significantly  reduced  in  size  by  applying  the  elliptic  grid  generation  techniques  of 
EAGLE  to  get  close  to  the  same  result.  A  smaller  elliptic  grid  of  13,266  nodes 
yielded  a  maximum  percentage  error  of  5.7  percent  which  substantiates  the 
hypothesis. 

Figure  9  shows  a  K  level  shaded  contour  plot  of  potential  for  one  of  the  wells 
with  the  highest  potential  being  at  the  river  (white)  and  the  lowest  potential  being 
at  the  wells  (black).  Figure  10  shows  several  I  level  surfaces  with  shaded  contour 
plots  of  potential.  Figure  11  shows  an  isolevel  plot,  and  Figure  12  shows  a  particle 
trace  plot  of  flow  from  the  river  to  the  wells.  One  surprising  result  is  that  the  left 
most  flow  line  skips  the  smaller  well  and  goes  for  the  bigger  one. 

An  extension  of  the  first  example  is  unconfined  groundwater  flow  in  a  part  of 
an  aquifer  (Figure  13)  with  a  river,  three  partially  penetrating  wells,  and  a  slurry 
trench  (EF).  Here,  16  subregions  (13  going  the  full  depth  of  the  aquifer  and  3  being 
below  the  partially  penetrating  wells)  are  used.  Figure  14  shows  a  shaded  contour 
plot  of  total  head  for  this  problem  with  the  river  painted  dark  for  visualization 
purposes.  It  is  interesting  to  see  the  effect  of  the  impervious  slurry  trench  on  the 
head  distribution. 


CONCLUSION 

The  techniques  and  tools  developed  by  aerospace  engineers  can  be  extremely 
helpful  in  groundwater  flow  problems.  Also,  visualization  techniques  are  an  essential 
and  valuable  tool  for  understanding  and  validating  the  results.  An  area  of  future 
research  is  the  application  of  finite  volume  techniques  used  to  solve  aerospace 
problems  being  applied  to  multiphase  flow  of  groundwater  with  contaminant 
transport. 
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Figure  9.  K  level  shaded  contour  plot 
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Figure  10.  I  level  shaded  contour  plot 
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Figure  11.  Isolevel  plot 


Figure  12.  Particle  trace  plot 
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Figure  13.  Part  of  an  aquifer 


Figure  14.  Shaded  contour  of  total  head 
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